General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 



TECHNICAL REPORT 


(MASA-CR-156930) AN ADAPTIVE LEARNING N78-25966 

CONTBOL SISTEH FOB AI8CBAFT (City Coll, of 
the City Oniv. of New lock.) 222 p RC 

A10/HF kOI CSCl 05A Onclas 

G3/81 21573 


AN ADAPTIVE LEARNING CONTROL SYSTEM F'JR AIRCRAFT 

by 

Ralph Makel and Solcaon Nachmlas 


Prepared For 



Fligh*; Dynamics and Control Division 
Langley Research Center 

National Aeronautics and Space Administration 
Hampton, Virginia 23365 



Grant No, NSG-1169 



THE CITY UNIVERSITY ol NEW YORK 


m 




E 

I 

I 

I 


TECHNICAL REPORT 

0ASA-CE-15693O) AN ADAPTIVE LEARNING 
CONTROL SYSTEM EOR AIRCRAFT ?City Coll, of 
the City Ouiv. of New York.) 222 p HC 
A10/MF i01 CSCL 05 A 


N78-25966 


Unclas 

G3/81 21573 




AN ADAPTIVE LEARNING CONTROL SYSTEM FOR AIRCRAFT 


by 


Ralph Mekel and Solomon Nachmias 


Prepared For 


I 


Flight Dynamics and Control Division 
Langley Research Center 

National Aeronautics and Space Administration 
Hampton/ Virginia 23365 






fr-V 

V- ’ 


Grant No*. NSG-1169 
April 197S 





THE CITY 

THE CITY UNIVERSITY of NEW YORK 






THE CITY COLLEGE OF THE CITY UNIVERSITY OF NEW YORK 
New York, N. Y. 10031 



AN ADAPTIVE LEARNING CONTROL SYSTEM FOR AIRCRAFT 

by 

Ralph Mekel and Solomon Nachmias 


Grant No. NSG-1169 
April 1978 


Prepared For 


Flight Dynamics and Control Division 
Langley Research Center 
National Aeronautics and Space Administration 
Hampton, Virginia 23365 


FOREWORD 


This report was prepared vmder NASA grant NSG-1169. The report 
describes a method for designing an adaptive learning control system 
for aircraft. 

The research was conducted under the direction of Professor Ralph 
Mekel v.’ho was assisted by a graduate student, Solomon Nachmias, supported 
under tt?e grant. Professor Mekel and Mr. Nachmias are with the City 
College of the City University of New York. Dr. Raymond C. Montgomery, 
NASA, Langley Research Center, was the technical monitor of this research 


program. 


ACKN01.VLEDGMENTS 


The authors wish to thank Dr. Raymond C. 1-tontgomery, NASA, Langley 
Research Center for his helpful suggestions and informative discussions 
about aircraft dynamics which provided insight into the six-degree-of- 
freedom simulation of the P-8 DFBW aircraft. 

We also wish to thank Mr. William H. Phillips and Dr. Raymond C. 
Montgomery for allowing us the use of NASA's real-time simulator. Thanks 
are also due to Dr. John D. Bird, Mr. Jarrell R. Elliott and Mr. Martin 
T. Moul for their many technical recommendations. 


AN ADAPTIVE LEARNING CONTROL SYSTEM FOR AIRCRAFT 


by 

Ralph Mekel and Solomon Nachmias 
ABSTRACT 

This dissertation presents the formulation of a learning control 
system and investigates its utilization as a flight control system for 
NASA's F-8 Digital Ply-By-V/ire (DFBW) research aircraft. The study takes 
the l>est features of two methods, i.e. the gain scheduling and the adaptive 
control method, and attempts to eliminate the undesirable features of each. 
One of the characteristics of this learning control system is its ability 
to adjust a gain schedule in a prescribed manner to account for changing 
plant characteristics. Another important feature of this learning system 
is that it can improve its performance 4nd the plant' s performance in the 
coiucse of its own operation. 

The adaptive learning control system consists of three subsystems: 

1) The Information Acquisition Subsystem which identifies the plant's 
parameters at a given oper-ating condition. The mathematical technique is 
based upon a model-reference system cohfiguration where the adaptive algo- 
rithm used to update the model's parameters are derived using both Liapu- 
nov's direct method and the Newton-Raphson method. 2) The Learning Algo- 
rithm Subsystem which relates the identified parameters to predetermined 
analytical expressions describing the behavior of the parameters over a 
range of operating conditions. The mathematical technique is based upon 

a sequential coefficient estimation derived using the least-square algo- 

\» 

rithm. 3) The Memory and Control Process Subsystem which consists of the 
collection of updated coefficients (memory) and the control laws derived 

iv 


by imposing desired plant performance characteristics. 

An artificial plant is presented in order to illustrate the perfor- 
mance of the learning control system and to compare two different designs 
based upon two different information acquisition subsystems: 1) Liapunov's 

method and 2) Newton-Raphson method. For this case the models Involved 
consisted of the uncoupled, linear time -invariant open-loop longitudinal 
and lateral dynamics of the F-8 aircraft associated with equilibrium flight. 

m applying the learning control system to the piloted six-degree- 
of -freedom simulation of the F-8 aircraft, reduced order discrete models 
were used for the information acquisition subsystem to conform v/ith the 
real-time constraint. Simulation experiments indicate that the learning 
control system was effective in compensating for parameter variations 
caused by changes in flight conditions over the entire flight envelope. 
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CHAPTER 1 
INTRODUCTION 

1.1 Statement of Problem 

This dissertation presents the , f orroulation of a learning control 
system. The problem that motivated the development of this learning 
control system was the desire to maintain uniform handling qualities of 
an aircraft over the entire flight envelope despite wide variations of 
dynamic pressure and other less predictable changes. 

A Learning Control System (LCS) has been developed to accomplish 
this objective by adjusting, in a prescribed manner, the feedforward 
and feedback gains of the aircraft control system. The problem of de- 
signing the LCS v;as divided into the design problem of three basic sub- 
systems that constitute the LCS. 1. The Information Acquisition Sub- 
system (IAS), 2. The Learning Algorithm Subsystem (LAS) and 3. The 
r-iemory and Control Process Subsystem (MCPS) . Two techniques are de- 
veloped jfor designing the IAS. The first technique makes use of Lia- 
punov' s direct method and the second technique is based on Newton- 
Raphson method. Each IAS may be combined with the LAS and MCPS to form 
a LCS. The LAS is designed using a sequential coefficient estimator 
technique v/hich is based on iterative least-square algorithms. The KCPS 
consists of the collection of updated coefficients which are used to 
compute the feedfonvard and feedback gains via a set of control laws. 

The next section describes a brief summary of prior work in this 
area. A more extensive description ' of prior work in adaptive and/or 
learning control systems is given in Chapter 2. 
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1.2 Summair/ of Prior V/ork 

Over the years the need for adaptive control on aircraft has 
been debated. The alternative of scheduling autopilot gains according 
to sensed pressure, altitude and Mach number appears to work v/ell for 
most aircraft and is relatively simple to implement. This gain sched- 
uling has the advantage of allowing rapid changes in gains and feedback 
paths as a ftanction of flight condition and vehicle configuration but 
suffers from the rather precise knowledge of the system dynamics re- 
quired to establish a workable gain schedule. This is one of the 
reasons for the surge of interest in systems that automatically adjust 
feedback gains as a function of aircraft stability and control charac- 
teristics evaluated on-line in flight. These adaptive systems have 
not gained wide acceptance. The X-15 aircraft, to our knowledge, was 

the only previous aircrsift that had an adaptive control system. Although 

(93) 

it worked reasonably v/ell, there v/ere difficulties. In designing . 

adaptive control systems most important is the problem of guaranteeing 

stability of the adaptive and control loops of the system under oper- 
(42 59) 

ation ’ ^ Still another practical problem is the frequent need 

for a dithering signal to excite the system during periods of control 
inactivity. The signal must be subliminal to the pilot and crew. The 
need to limit the amplitude of this signal usually requires that the 
adaptive loops operate with a most unfavorable signal -to -noise ratio. 

The problems of the existing adaptive techniques indicate that the sys- 

i 

tern should allow gain adjustments to be made only when the adaptive 
system passed certain tests. This restriction, however, may preclude 
rapid gain adjustments and if such adjustments are required, only a gain 
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scheduling system would be able to handle the problem. Seemingly, the 

I 

next advance in flight control systems should take the best of the two 
worlds of gain scheduling and adaptive control and eliminate the un- 
desirable features of each# 

A suitable approach to design the information acquisition sub- 
system is the parameter adaptive modal-reference system. This scheme 
consists of a system, a model, and an adaptive algorithm for adjusting 
the model parameters, such that the model will converge asymptotically 

to an equivalent input-output representation of the unknown system. 

( 89) 

This approach ’.vas first considered by VJhitaker et al in their ap- 
plication to controlling the behavior of an aircraft. A more extensive 
summary of prior work on parameter adaptive model-reference systems is 
presented in section 2.2'. 

The learning algorithm subsystem introduces memory capabilities 
to the overall system thus, resulting in a self -organizing or learning 
control system. The learning system has the capability of changing its 
basic structure (including the adaptive logic and the performance in- 
dex) as a function of its experience and/or its environment. Such a 
system is usually expected to learn the solution to a control problem 
on-line, a process required because of, and despite, incomplete a priori 
knowledge of the plant and its environment. A more detailed description 
of prior v;ork In the field of learning systems is presented in section 
2.3. 

1.3 Summary of Results Obtained 

Qiapters 4 and 5 present the results obtained by the application 
of the learning control system to control the longitudinal and lateral 
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dynamics of the F-8 DFBW aircraft. 

In Chapter 4, the learning control system simulation is not con- 
cerned with real-time while the aircraft is in flight. For the modeling 
of the longitudinal and lateral dynamics of the plant, fourth order 
linearized representations are used associated with equilibrium flight. 

A comparison of two different learning control systems based upon two 
different information acquisition subsystems: 1) Liapunov method and 
2) Newton -Raphson method is also discussed in this chapter. The com- 
parison reveals that the Nevrton-Raphson information acquisition sub- 
system results in an overall superior performing learning control system. 

Our learning control system developed in Chapter 3 is applied in 
Chapter 5 to the piloted six-degree-of -freedom simulation of the F-8 
DFBVJ aircraft. Due to the real-time constraint, the information acqui- 
sition subsystem is implemented using reduced order models. I-tore spe- 
cifically, the longitudinal model describes only the short period longi- 
tudinal mode without considering the longitudinal phugoid mode because 
its time constant is normally large, and its definition is not a major 
element in adjusting the feedback gains of the primary control loops. 

For the same reason, only the lateral predominant modes are 
characterized by our model namely, the lateral roll mode and the lateral 
Dutch roll mode. The reduced order models generate a modeling error 
noise. Due to the introduced modeling error noise and to assure proper 
learning of the system parameters, v;e develop convergence and confidence 
criteria. 

■As shown in the results of the piloted six-degree-of-freedom 
simulation of the ?-8 DFBW aircraft, the handling qualities of the air- 
craft remain the same despite v;ide Vcuriation of flight conditions. This 
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In turn, implies that the learning control system may be applied to 
control plants whose parameters are functions of several variables. 

To do so, the same approach as presented In this dissertation may be 
followv_d with slight modifications depending upon the structure of the 
plant. 

The main difference between the learning system developed here 
and parameter adaptive systems of the type studied in reference is 

that the learning system recognizes patterns of the parameter estimates 
over the state space and parameterizes these estimates over that space 
for later use. For aircraft this implies that, for example, the param- 
eter estimate Ho< , partial derivative of the pitching moment with re- 
spect to angle of attack, be recognized as strongly dependent upon 
altitude and roach nximber (L,H) and that the explicit functional repre- 
sentation be determined for future use. Thus, if an aircraft has been 
flom previously at one (L,M) condition v;here a successful identifica- 
tion of was made, a learning system as described here, v/ould recall 
the previously learned value, whereas a parameter adaptive system would 
require another identification of Ho< • For the latter case auxiliary 
inputs must disturb the vehicle on a regular basis to facilitate con- 
tinual parameter identification. Alternatively, the learning system 
creates relationships among parameter estimates versus flight conditions 
and assembles these relationships into maps (memory) as information 
becomes available during normal flight v/ithout requiring auxiliary dis- 
turbance inputs. 

The learning control system is an improvement over conventional 
gain scheduling techniques because it introduces a significant having 
in storage and also makes practical the real time computation of the 
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parameters and the gains by a table look -up. As compared with conven- 
tional adaptive systems, the learning control system introduces an 
Improvement because it does not require a dither signal to excite the 
system during periods of control inactivity. The learning control 
system may also be viev/ed as a parameter estimation system. In this 
case, it is a global estimator over the entire flight envelope, thus 
providing information about parameters that are functions of several 
oth^ variables. To emphasize this aspect, we present in Chapter 5 
three dimensional curves vjrhich describe the variation of the parameters 
v/ith respect to altitude and mach number (L,M) as obtained from the 
learning control system. 

1.4 I‘lathenatical Formulation 

The mathematical formulation and functional organization of the 
learning control system is presented in Chapter 3 . Mathematical tech- 
niques used to design each subsystem are also developed in this chapter. 
Both Liapunov' s direct method and Nev/ton-Raphson method are used to 
design the adaptive algorithm of the information acquisition subsystem. 
These parameter adaptive approaches are found in sections 3.2 and 3.3 
respectively v;here we also develop two convergence criteria, one for 
each individual information acquisition subsystem, as an aid in deter- 
mining how much adaptation has taken place over the identification 
interval. This criterion may also be used as an aid in choosing the 
parameters of the adaptive algorithm based on the Liapunov' s direct 
method. The development of the learning algorithm subsystem presented 
in section 3.4 is based on an iterative least-square algorithm. Ih 
section 3.5 we describe the memory and control process subsystem which 
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consists of the collection of updated coefficients that are used to 
compute the feedfor'./ard and feedback gains via a set of control laws* 
Ihe next chapter presents a reviev/ of adaptive and/or learning 
systems and flight control systems* 
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CHAPTER 2 

HISTORICAL BACKGROUND 

2.1m Introduction 

This chapter presents a brief sununary of the state of the art of 
adaptive and/or learning control systems as applied to flight control 
systems • 

In general, adaptive and learning control systems represent an 
effort to extend the operating range of conventional control systems 
exposed to extraordinarily broad variations of enviroment or parameter 
values. If these variations are severe, it becomes necessary to alter 
certain system parameters to produce a satisfactory response over the 
entire range of operation. Furthermore, an adaptive and/or learning sys- 
tem also provides a means of continuously monitoring its own performance 
in relation to a given figure of merit (criterion) and thereby modify 
its own parameters by some action so as to approach the best performance. 

The advancements of flight control systems and the theory required 
to permit increased performance of future aircraft will be discussed in 
sections 2.4 and 2.5 respectively. 

Section 2.2 .will discuss adaptive systems, in particular, Parameter 
Adaptive I-5odel Reference Systems (PAMRS) . PAl-IRS happen to be the very 
popular adaptive systems because they are useful in various applications 
v/hich require rapid adaptation, for example, autopilots and the modeling 
of various systems such as socioeconomic or physiological systems. 

Section 2.3 will examine the theoretical state of the art of Learn- 
ing Control Systems (LCS) and the reasons for limited application of these 
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systems as apposed to adaptive control systems. 

The evolution of analog to digital flight control systems which 
has spread through avionics so that it now reaches the central core of 
the automatic flight control systems and the reasons that are attributed 
to this occurrence are examined in section 2.4. 

Finally, section 2.5 gives particular reference to the NASA P-8 
Digital Ply-By-Wire (DPBW) program. 


2.2. Parameter Adaptive Model Reference Systems (PAKRS) 

One of the conceptually simplest and most flexible methods for 

( 8 9 22) 

system identification is the model reference system approach^ ’ ' 

A )cnown input is simultaneously fed to the process and to a model with 
adjustable parameters. The adjustable parameters are changed by an ad- 
justment mechanism which receives the process output and the model output 
as inputs. If the algorithm for adjusting the parameters is designed 
properly, then the model-reference system output error will eventually 
vanish and the model will converge asymptotically to an equivalent input- 
output representation of the unknown system. Such schemes consisting of 
a system, a model, and an adaptive algorithm are usually referred to as 
parameter adaptive model-reference systems (PAI-1RS) . PAI-IRS are well suited 
for on-line identification. A basic feature of PAI^S is that they can 

also be used in adaptive control where the roles of the system and the 

, ^CS7,91) 

model are reversed 

I The PAMRS formulation of the on-line identification problem was 
first considered by V/hitaker et al^^^^ in their application to controlling 
the behavior of an aircraft. In their report, they derived the adaptive 
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algorithm based on the gradient of an even function of the error between 

model and reference system outputs. The error function chosen was very 

critical and convergence was not guaranteed. 

( 66 ) 

Osbum et al designed the PAMRS using a pexrformance index 
minimization method which has since been referred as the MIT design rule. 
The performance index is the integral squared of the response error. 

This rule has been very popular due to its simplicity in practical im- 
plementation. 

The requirement that the closed loop is stable is a necessary 
design criterion. Since the system consisting of the adjustable model, 
the process, and the adaptive algorithm is nonlinear the stability problem 
is not trivial. In order to guarantee system stability for all inputs 
one may use Liapunov's direct method to design the adaptive algorithm of 
PAI-P»S by selecting the design equations v;hich satisfy conditions derived 
from Liapunov's second method. Other methods are also available*. 

( 9 ) 

Eutchart and Sha^kcloth first suggested the use of a quadratic Lia- 

(67) 

punov function, which was employed later by Parks to redesign systems 

formerly designed by the KIT rule. However, Parks chooses the derivative 

of the Liapunov fxinction negative semidefinite so that if the error goes 

to zero, the parameters stop changing and consequently the design equa- 

(75) 

tions become input dependent. In a paper by Shahein et al this 
shortcoming is emphasized and an attempt is made to choose the derivative 
of the Liapunov function negative definite in error and parameter mis- 
alignment. Hox>rever, the obtained parameter adjusting equations are de- 
pendent on the derivatives of the input and unknown parameters. 


* See References (42, 43) 
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Practical applicability of the Liapunov designs of PAIIRS is lim- 
ited because disturbances and incomplete adaptation^ have a destabi- 
lizing effect on the system stability. A serious limitation occurs when 
the plant output measurements are corrupted with noise, leading to 
biasing in the adaptive loop, and to erroneous parameter compensation. 

A related problem is created when the state variables required in the 
Liapunov identification method have to be generated from noisy measure- 
ments. This last problem can be partly circumvented by reducing the 

order of the required state variable generator (SVG) ^ ^ by using the 

( 36 ) 

Kalman lemma ^ of a positive real transfer function, or by using re- 

( 67 ) 

duced state feedback . Another disadvantage is that the Liapunov 

design rule for adaptive control may not be applicable to cases where 

4he plant parameters cannot be directly adjusted. Such a case was men- 

( 91 ) 

tioned by v;insor et al and a solution, though quite complex, xi/as 

( 22 ) 

offered by Gilbert et al 

I'tost other designs of PAI-IRS are based on a gradient approach which 

usually results in systems that are only locally stable, and therefore, 

their convergence is only guaranteed if the state and the parameters of 

( 25 ) 

the model are alv/ays close to those of the plant's . This character- 
istic makes these approaches unreliable during the initial stages of the 
adaptive process v;hen the state and parameters of the plant and the model 
may not be close. For the same reasons, the rate of adaptation of these 
gr;adient methods has to be always kept very small. Different gradient- 
based nonlinear programming methods can be combined in a imified frame- 

v7ork and may be applied to the adaptation algorithm via the maximum 

( 25 ) 

likelihood principle . The use of maximum likelihood estimation In 

( 14 ) 

practice leads to difficult nonliriear programming problems 


In a lot 
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of cases the likelihood fiinction has inultiple maxima in the parameter 
space and the use of the gradient techniques may lead to convergence to 
the wrong stationary points* It is important to locate the absolute 
maximum of the likelihood function since it provides the unbiased esti- 
mate of the parameters. It should be clear that the difficulties in 

obtaining the absolute maximum of the likelihood function does not in- 

{ 14 ) 

validate the likelihood principle . The anomalies in the likelihood 

function are usually caused by either overparameterization ( leading to 

nearly singular information matrix) , or underparameterization ( leading 

( 14 ) 

to saddle points) . These problems have been considered by Edwards , 

Astrom and Bohlin^ ^ ^ , and Bohlin^ ^ ^ . 

The starting values for the iterations used to maximise the 

likelihood function are very important to ensure convergence to the 

♦ 

absolute mc:.:imum. The computation of the gradients of the likelihood 
fxnction with respect to the adjustable parameters usually require the 
knowledge of the sensitivities of the model states with respect to the 
parameters which is the most time consuming part in the adaptation algo- 
rithm. 


For this reason, research has been done on the computation of 

( 3 ) 

state sensitivities using reduced order models, Astrom and Eohlin 
have proposed techniques for the model reduction in the case of single- 
input single-output systems in a canonical form. For the multi-input 
multi -output case little research has been done to develop techniques 
to reduce the order of the model, except of finding bounds on the order 
of the model^^^’^°l 

The previous discussion suggests that one should consider and 
study the usefulness of other identification and modeling methods. One 
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of the recently more advcinced techniques for this purpose caii be considered 
to be the adaptive learning control system concept. Therefore, in the 
following section, background information about learning control systems 
is discussed. 

2.3. Learning Control Systems 

(18) 

Published literature on the theoretical state of the art of, 
learning control systems has grovci substantially within the past ten, 
years. Practical applications however, are not nearly so great in. number 
nor as widely distributed and read. This may be due to the cost and dif- 
ficulty of implementation and that previous levels of research and de- 
velopment efforts in this area have not been maintained as a result of 
the national economy. 

Thl*^ section attempts to give a brief background of the various 
types of learning control systems and to describe the present state of 
learning control applications through the use of both, mini and general 
purpose digital computers. 

The behavior or performance of certain types of advanced control 
systems have been labeled adaptive and leaming. Learning is most widely 
used and accepted as describing t;he appropriate use of past experience 
and the resulting improvement in overall system performance. 

A learning controller can be described as any control system that 
a) collects pertinent information (past and present), about the process 
it controls, and the environment it worlcs in and then b) processes this 
information to optimize a certain performance index. The order of the 
hierarchy is the distinguishing factor betv/een leaming and adaptive 
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systems* Control systems using information contained in their past 
input y output and envlxonmental feedhaclc signals which improve their 
ovm performance are termed adaptive* A self -organizing or learning 
control system is consequently one which demonstrates both, adaptive 
and memory capabilities* A learning system has the capability of 
changing its basic structure (including the adapting logic and the per- 
formance index) as a fmction of its experience and/or its environment* 

Such a system is usually eicpected to learn the solution to a control 
problem on-line, a process required because of, and despite, incomplete 
a priori knov/ ledge of the plant and its environment* Theoretically, 
a learning system is also capable of overcoming failures of the adap- 
tive loop* 

There are many variations of learning concepts which have been 
proposed, not all of them agreeing with the other* Many have been 
labeled learning v/hen they do not conform to this def .inition and are 
simply adaptive control systems*. 

In the field of learning systems for automatic control, there is 
not any one single theory v/hich forms the base-line* A mixture of the- 
ories and techniques must be used* 

The most prominent mathmatiqal techniques to formulate a learning 

system are described below: 

(TQ 19 32) 

1) Decision Theory * ’ ’ .In this method one tries to specify a de- 
cision surface so as to minimize the probability of error. The system is 
called a pattern classifier and the simplest one is the linear case where 

thei decision surface is a hyperplane. A modification of this algorithm 

(32) 

has been suggested that tries to arrive at a piecewise linear decision 
surface from a given set of classified feature vectors. This modified 
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method is called the trainable controller method. It quantizes the state 
space to elementary hypercubes in which control action is assumed con- 
stant. During the training process the trainable controller makes 
changes in its weight based on the training pattern being "shown” to it, 

together with the output of that pattern. This is done until the number 

(79) 

of classification errors has reached a steady state value. It is shown 
that this method can approximate a decision surface to an arbitrary de- 
gree of accuracy (by increasing the number of quantum zones). 

2) Markov Chain Theory ^ . This method provides an approach to 

modelling the dynamics of learning controllers v This method is also 

(64) 

called learning with reinforcement even though this is a more general 
concept. By learning with reinforcement is meant the process where there 
is a continuous scale of rev;ards with strict reward and strict pmishment 
representing the upper and lower bovands respectively of this scale. The 
engineering system is then designed to extremize reward. This method is 
used when there are ±n distinct control actions and the system chooses 
one of them by a specific probability* The effect of learning in this 
system can be viewed as an iteration of probabilities; hence, from one 
trial to the next a transition in the probabilities occurs. In general, 
it can be proved that the probability of correct action v/ill converge to 
its maximum in the mean^^^^ . 

( 1 33 ) 

3) Bayesian Learning In Control Systems ’ . This is an on-line ap- 

proach to improve the performance of a linear stochastic system by reduc- 
ing the iuicertainties about the plant parameters belonging to a finite 
set; if not, discretization of the parameter space should be done, which 
is a serious shortcoming of the method. A class of estimators and con- 
trollers is formed among which the stochastically optimal combination is 
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a member. Baye's rule is then applied to reduce the uncertainty of the 
unknown parameters by sequentially computing their respective a posteri- 
ori probabilities. The learning feature of the a posteriori probabili- 
ties is utilized in order to select the feedback controller among the 
possible combinations available. This algorithm is designed for linear 
Gaussian systems with quadratic performance criteria and unknown system 
parameters belonging to a finite set. 

( 84 85) 

4) The Stochastic ADproxlmation Method ^ ’ . This method has been 

introduced by Tsypkin. The method utilises a procedure where the un- 
known system dynamics as well as the control action are modeled by lin- 
early independent functions weighted by adjustable coefficients. The 
learning scheme consists of updating the weighting coefficients to match 
the approximate combination of stochastic and optimal control. This 
method requires the evaluation of sensitivity matrices which, in the case 
of a linear plant, depend on the unknown system parameters. A difficulty 
arises with the choice of appropriate matrix functions which should span 
the corresponding space. This method is also called ”hill climbing”, 

since one perturbs the adjustable parameters in the direction of increas- 

] 

ing performance index. 

Having examined adaptive and learning control systems, in the fol- 
lowing sections we will discuss the advancements of flight control systems. 

2.4. Analog and Digital Flight Control Systems 

From a meager start in sensors, the digital revolution has spread 
through avionics until it has now reached the central core of the auto- 
matic flight control system. The realignment of Industry from analog 
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systems to digital systems is based on expectations that a number of 
problems associated with existing analog systems will be eliminated both 
for the designer and the user . 

Many characteristics attributed to digital systems are said to 
solve the analog problems. Digital automatic flight control systems are 
a reality and, while they may not yet be perfect, they offer substantial 
advantages in capability, reliability and cost effectiveness over another 
generation of analog systems. 

In terms of capability, the functional requirements for automatic 
flight control systems have been a joint function of basic electronic 
t^hnology, packaging and the evolution of commercial transports. An 
automatic flight control system (APCS) that can provide automatic control 
virtually from lift-off is basic to the aircraft. In conjunction v/ith 
sophisticated navigation and sensor subsystems, the pilot is able to en- 
gage the AFCS shortly after take-off, climb to cruise altitude, fly along 
a predetermined route, and automatically land in near zero visibility 
conditions at the end of the flight by doing little more than pressing 
the correct buttons at the appropriate time. 

The continuing trend is toward each generation of APCS being more 
complex than its predecessor. The interface requirements have arisen 
from the number of sensors, actuators and logic data required to support 
the AFCS computer in accomplishing the various pilot-selected modes. 

The large amount of interface connections is significant to the user, 
since each connection represents a point where malfunctions or misfunc- 
tions can occur. 

A factor that influences analog system capability and maintain- 
ability is the nature of the analog computer and its dedicated circuitry. 


For each branch of the system control laws, a separate computational 
path comprising dedicated circuitry must be utilized. This has led to 
much difficulty in identifying and isolating system failures so that 
appropriate maintenance activity can be conducted. To alleviate these 
difficulties, research has been directed towards digital control sys- 
tems. 

Numerous studies in the past several years have been conducted 
to determine the applicability of digital computers to automatic flight 
control. The benefits most commonly attributed to digital flight con- 
trols can be categorized as improvements in comfxitational accuracy, de- 
sign flKcibility, improved cost and reliability integration of functions, 
and Improved capability for self -testing. 

A digital computer is not inherently an accurate computer. There 
is a finite resolution to each calculation made that is a function of 
the scaling and digital word length. (Analog resolution is theoretical 
infinite). The digital computer's primary characteristic is that the 
calculations are repeatable. Two computers given the same input, v;ill 
provide precisely the same output because the computers are not subject 
to internal analog errors such as gain tolerances or shifting null volt- 
ages. Repeatability of calculation is a distinct advantage in achieving 
virtually perfect mutual comparison or tracking between the outputs of 
computers operating in parallel-redundant systems. 

A major aspect of digital computer flexibility is the commonality 
of the processor with many different applications. The processor non- 
recurring costs are then spread out over several different systems. This 
supposition is felt to be in particular contrast to the e:cperience with 
analog systems where it is unusual for a system designed initially for 





I 


ii- 


I 

i 



I 


I 

I ; 


19 

Installation on one airplane type to be used for a subsequent airplane 
model. The prime motivation factor, however, for changing auto-pilots 
from one model to the next is the changing electronic technology and 
not the inability of simple adaptation of one system to all the models. 

One of the major advantages of a digital computer is the capa- 
bility to time share the computational equipment. That is, unlike the 
analog system in which computational elements are mainly dedicated to 
one task, a digital system can use the same equipment to perform sever- 
al unrelated computations v/ithin a given time frame. Failure to apply 
this capability will impair the cost-effectiveness of a digital flight 
control system. 

The potential for self -test within a digital flight control sys- 
tem is substantially higher than for a similar analog system. The main 
problem in the analog case is that the built-in test equipment (BITE) 
requires dedicated circuits and directly affects complexity, reliability 
and cost. Such is not the case with a digital system. Self -test does 
require additional program to implement but no other dedicated circuitry 
need be required. The next section presents a brief overview of the P-8 
Digital Fly-By-VJire (DFBW) study. 

2 .5 ^ The F-8 Digital Fly-By -Vf ire (DFE^>J) Program 

The broad objective of the NASA P-8 Digital Ply-By-VJire program is 
to provide the technology required for implementation of advanced, reli- 
able, digital fly -by -wire (DPBVJ) flight control systems which will permit 
greater operational capability and increased performance of future air- 
craft. The program makes use of an F-8C naval fighter test aircraft which 


1 
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has been modified by removal of the mechanical flight control system and 
its replacement with an electronic flight control system* 

The program has been conducted in two phases. Phase I explored 
pilot acceptability and technical feasibility of digital fly-by-wire 
using a single channel digital system constructed from components de- 
veloped previously for the Apollo Space Program. Phase II objectives 
include establishing a design base for practical multiple channel DFBV/ 
systems using a triplex digital system, to flight test the system and 
certain selected space shuttle flight control system concepts and to 
conduct research into and evaluate advanced control law concepts suit- 
able for digital implementation. 

Research to investigate and promote advanced control laws for 
possible flight experimentation has been motivated by the greater flexi- 
bility and logic capability of digital systems as compared to analog 
systems and by the increased complexity and sophistication expected of 
future aircraft flight control systems. 

There are two major types of adaptive control. One type, some-' 
times called analytical redundancy, has to do with adaptation to fail- 
ures in control system components internal to the aircraft such as 
sensors and actuators. The idea is to take advantage of the kinematic 
or dynamic relationships which exist between the sensors and/or actu- 
ators of a moving physical system such as an aircraft to complement or 

( 11 , 30 ) 

reduce the hardware redmdancy needs of the physical system . 

The second type of adaptive control has to do with the adaptation 
to the changing external environment of the aircraft, such as changes 
in dynamic pressure, mach number and altitude, etc. 
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These adaptive systems have not gained wide acceptance peurtly 

due to the success of gain scheduling technology and also due to the 

existence of both theoretical and practical problems that are still 

unresolved in the field of adaptive control* ttost important is the 

problem of guaranteeing stability of the adaptive and control loops of 

(42 59 ) 

the systen under operation' ’ . Another practical problem is the 

frequent need for a dithering signal to excite the system during peri- 
ods of control inactivity. The signal must be subliminal to the pilot 
and crew. The need to limit the amplitude of this signal usually re- 
quires that the adaptive loops operate with un favorable signal -to-noise 
ratio. 

The problems mentioned above indicate that a requirement needs 
to be imposed on any adaptive system regarding when to allow gain ad- 
justments to be made. The system should allow gain adjustments to be 
made only when the adaptive system passed certain tests. This restric- 
tion, however, may preclude rapid gain adjustments and if such adjust- 
ments are required, only a gain scheduling system would be able to 
handle the problem. Seemingly, the next advance in flight control sys- 
tems should take the best of the two worlds of gain scheduling and adap- 
tive control and eliminate the undesirable features of each. This dis- 
sertation describes a system intended to accomplish that objective. The 

( 55 ) 

described system is called a learning control system . It is, in 
fact, an adjustable blend of a gain scheduling system and an adaptive 
control system. 

Having considered background information on adaptive learning con- 
trol systems and flight control systems, our Learning Control System (LCS) 
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will be developed in general terms in the following chapter. We will 
then consider the P-8C aircraft to be the plant and apply our IXIS to 
control its dynamics in chapters 4 and 5« 
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CHAPTER 3 

THE LEARNING CONTROL SYSTEM 

3.1. Introduction 

The basic problem to be considered in this chapter is that of de- 
siring a Learning Control System (LCS) that is capable to meet design 
retirements over many possible operating conditions of the plant by 
adjusting, in a prescribed manner, the feedforward and feedback gains of 
the plant. Figure 3.1 shows the functional organisation of the learning 
control system that consists of three basic subsystems: 1. The Information 
Acquisition Subsystem (IAS), 2. The Learning Algorithm Subsystem (LAS) and 
3. The Memory and Control Process Subsystem (MCPS) . Tv-;o techniques are 
described for developing the IAS. The first technique makes use of Liapunov's 
direct method and the second technique is based on Newton -Raphson method. 

Each IAS may be combined with the LAS and MCPS to form a LCS. The LAS is 
developed using a sequaitial coefficient estimator technique which is based 
on iterative least-square algorithms. The MCPS consist of the collection 
of updated coefficients which are used to compute the feedforward and feed- 
back gains via a set of control laws . In the next sections we will describe 
general mathematical techniques used to develop the above mentioned subsystems. 
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3«2« The Information Acquisition Subsystem (IAS), Liapunov’s Direct Wsthod 


The basic problem to be considered in this section is that of design- 

j 

ihg an adaptive algorithm that will result in a stable Information Acquisi- 
tion Subsystem (IAS). Liapunov's stability theorems offer possible solutions 
to this design problem, the particular solution depending in part upon the 
form of the Liapunov function selected. 

Consider the Information Acquisition Subsystem (IAS) of 
Pig. 3.2 where the linear multi -input multi-output plant with constant or 
slowly time varying parameters is described by the system of differential 
equations 


y > Ap(p)y + Bp(p)<r^ 


(3.2.1) 


where y is the n r dimensional state vector and S' is the r - dimensional 
input vector. Matrices A^(p) and Bp(p) describe the dynsimic characteristics 
of the plant. Let S ^ si linear combination of the states and the control 


input given by 


- Ky + G cTg 


(3.2.2) 


where S is the control input and G and K are the feedforward and feedback 
s 

gain matrices of the plant respectively. Substituting Eg. (3.2.1) yields 
y ■ ^ (3.2.3) 

Let the plant described by Eq. (3.2.3) be represented by a model of 
the form 

z = Fz + Fa (p) - f 1 y + B (p)<5* (3.2.4) 

Im J m s 

wKere F is a stable matrix, z denotes the n - dimensional model state vector 
end ®m^P^ describe the model closed loop dynamic characteristics 
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as a functicn of parameter (p) . Note that (p) and matrices A (p) and 

m 

B^(p) must satisfy the following conditions: 

A (p . p) - rA„(p) + B^(p)k 1 - A*(p) (3.2.5) 

™ L P P J P 

B (p - p) - B (p)G « b’(p) (3.2.6) 

m p p 

The dimensionality of the model is assumed to be the same as that of the 
plant, het the model-plant state error be defined as 

e » z - y (3.2.7) 


The model-plant error differential equation yields 

e . F« + 

t:i Lsi 

where 

A. 


LSI 


(3.2.8) 


(3.2.9) 


(3.2.10) 


Vectors b^ and d^ are constant for all i and u^ and w^ are vectors whose 
elements are the misalignments of parameters (p) and (p) of the i-th row. 

In the Liapunov approach a Liapunov function V is chosen which is 
a quadratic form of the system error e and the parameter misalignments 
and w^. Let 

V a e^Pe + ^ ■*“ 

i.=i 


n 


''iVi 




(3.2.11) 
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where P, and are symmetric positive matrices with constant elements. 
The time derivative of the Liapunov function yields 


V - e^(P^P + PF)e + 2^ + y'’^(b^Pe)j UjL + 


^ 2 




fcsl 


(3.2.12) 


The matrix P can be found as the unique solution of 


P P + PP =. - Q 


(27) 


(3.2.13) 


where Q is any n x n symmetric positive definite matrix 

Applying Liapunov* s stability criterion V > 0 and V ^ 0 and 
Integrating yields 


w 

T / T T •! ' 

u^ * - I y '(b^Pe)N^ at + u 

o 

t 

w? a - r ^^(d'FPe)QT^dt + w? 
X J s i X 


T 

io 


T 

io 


(3.2.14) 


(3.2.15) 


Substituting £qs. (3.2.14) and (3.2.15) into Eqs, (3.2.9) and (3.2.10) 
and rearranging the resulting equations one obtains 

t 


y =■ -2 f 


t 

- VPo> 

c*t •'o 


where 


n. 

- ^p'P’ - 


T 

io 


(3.2.16) 


(3.2.17) 


(3.2.16a) 


LSI 
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n. 

VPo> - VP» -ZvL 

A 

Note that is an initial estimate of vector p and u. , w. are the 
*^o ^ io io 

initial misalignments at t ■ 0. After this initial choice, we proceed 
to solve Eqs. (3.2.16) and (3.2.17) to obtain vector p. 

Substituting Eqs. (3.2.13), (3.2.14) and (3.2.15) into Eq. (3.2.12) 

yields 

V « - e^Qe (3.2.18) 

The form of V is negative definite with respect to e because Q is positive 
definite. Therefore, the error differential Eq. (3.2.8) is asymptotically 

j 

sliabla in the v;hole for e which means that after an initial disturbance 
the Euclidean norm {[ejj — ► 0 for t— . Since V is indefinite with 
respect to u^ and w^, the parameter misalignments do not necessarily go to 
zero, which means that the identification is not exact. To overcome this 
difficulty, another input is applied to the plant, if possible, after the 
model-plcint state error has reached the steady state. Under certain cir- 
cumstances , there may be a need to apply several inputs in order to obtain 
a better identification. 

As we pointed out earlier, the adaptive algorithm may vary according 
to the Liapunov function selected. To illustrate this, let us choose a 
somewhat different Liapunov function 

V ■ e*^Pe + ^ + 

(3.2.19) 

. X [“i " ''‘1 

LSI 



^ IT 
>^Pe)y 


"i[“i * 


The time derivative of the Liapunov function follov/ing from Eg. (3.2.19) is 


n. 


V - e^(p'^P + PP)e + 2 ^ |u^ + d/dt j(b^Pe)^l'^. + (b^Pe)^ 


tsl 


fV 




t=l 


n- 


a 


2 ^ y^(b^Pe)u^ + 2^,rg (d^Pe)w^. (3.2.20) 


t,S-t 


t=l 


Again, applying Liapunov's stability criterion V > 0 and V £ 0 yields 


t 

a -J' y^(bTpe)M~^dt - y*^(b^] 


T 

Pe) + u. 

xo 


(3.2.21) 


i: 

a - J ,f^(d^Pe)Q^^dt - (T^(d^Pe) + 


io 


(3.2.22) 


The adaptive algorithm equations resulting by substituting Eqs. (3.2.21) and 
(3.2.22) into Eqs. (3.2.9) and (3.2.10) are different from the previously 
obtained Eqs. (3.2.16) and (3.2.17) because of the different choice of the 
Liapunov's function and are given below by 




A (p ) - 
m *^o 



biy'^(bJ?e)N“^dt b^y'^(b^Pe) (3.2.23) 




L:t 




B (p) - B (p ) - 5 " - y d.vJ’^Cd'J'pe) (3.2.24) 

m mo Z-.. I X s 1 x x . x s x 
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Substitution of Eqs. (3.2.21) and (3.2.22) into Eq. (3.2.20) yields 

V - - e^Qe - 2 ^ y‘^(bjpe)^y - 2 (3.2.25) 

Lsi t,-i 


Since Eq. (3.2.25) contains more negative terms than Eq. (3.2.18) the 

adaptive algorithm defined by Eqs. (3.2.23) and (3.2.24) results In an 

( 88 ) 

accelerated error convergence even though V is still negative definite 
in error e only. Note that Eqs. (3.2.23) and (3.2.24) are solvcxJ in a 
similar manner as Eqs;. (3.2.15) and (3.2.17). 

The performance of the IAS is monitored by a convergence criterion 
defined by 

V(t)/V(0)< (3.2.26) 


where ^ is prescribed by the designer according to the desired accuracy 
beto'/een vectors p and p for a prescribed length of time v;hile the system is 
sul^ject to sufficient excitation. Since the plant parameters are xinknovin 

I 

we have no knowledge of u^ and which means that we cannot calculate the 
Liapunov function V( t) and consequently the convergence criterion ^ . To 
overcome this difficulty let us rewrite Eq. (3.2.26) as 

■fc 

/V|^- V(t)/V(0) » 1 + j”v(t) - V(0)^ /V(0) - 1 + /"v(t)dt/v(0) (3.2.27) 


The Liapunov function may be viewed as being composed of three parts, i.e. 


V1+V2 ..V3 


(3.2.23) 


consider Eq. (3.2.11) and let 
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T 

m e Pe 



n, 


2 


Ls-I 


T 

Z,Wi 


til 


(3.2.29) 


If matrices N. and Q. (are chosen such that 
r i 


P ^ and P ^ (for every 1) 


(3.2.30) 


then V(0) may be approximated by 

Tnv 

V(0)= V,(T ) - I V(t)dt 
X m I 

O' 


(3.2.31) 


where T is the time at which V, reaches its maxiittum value, V 
m i 

tuting Eq. (3.2.31) into Sq. (3-2.27) yields 


Imax* 


Substi- 


'a 


a 1 


[''to -J 


(3.2.32) 


which is the approximation of Eq. (3.2.26) used to calculate the convergence 
criterion • For the adaptive algorithm given by Eqs. (3.2.23) and 
(3.2.24) the same approximation, Eq. (3.2.32), may be used to calculate the 

m 

convergence criterion 4^, but in this case V(t) should be used as indicated 
by Eq. (3.2.25). 

VJhen the output measurements of the plant are corrupted with noise, 
this noise appears both in the plant output and in the error signal. Because 

ORiGiNAn Page I& 
OP POOR QUALITY 


of the multiplication in the adaptive law Eq* (3*2.16), the computed p^ 
will be biased • To overcome this, one may choose a model of the form 

a » A (p)z + B (p) r (3.2.33) 

m m . w s 

leading to an error differential equation of the form 

n n 

e » Ap(p)e + ^i^i^ y ^i^i^ (3.2.34) 

A similar type of< development as before, will produce the follov/ing 
adaptive algorithm equation 

(3.2.35) 

which is to be used instead of Eq. (3.2.16) in the case of small signal 
to noise ratio- In this case the elements p^ will not be biased since one 
uses the outputs of the model, denoted by a, vjhich are not directly af- 
fected by noise, while in Eq. (3.2.16) one directly uses the noise obscured 
measurements of the plant to be identified. 

I Parameters p^ after they have passed the convergence criterion, 

defined by Eq. (3.2.26), are packed into a 1 - dimensional parameter vector 
p which will be used as an input to the Learning Algorithm Subsystem (LAS) * 
The next section describes an alternate approach to develop the IAS, 
namely the Ne;irton-Raphson method. 


• Mote that p^^ are the elements of matrix A^(p) 




A (p ) 
m o 


-t f V 

L-l-i 


T T -1 
(b^Pe)N^ dt 
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3.3. The Information Acquisition Subsystem (IAS), Newton-Raphson Method 


In this section the adaptive algorithm of the (IAS) will be designed 
by using a gradient -based nonlinear programming method that can be used for 
computing maximum likelihood estimates. Let the plant described by Eq. 
(3.2.3) be represented by a model . 


X = A (p)x + B 
m m 


(p)/. 


(3.3.1) 


vdiere x is the n - dimensional model state vector and all other variables 
are as defined before. The dimensionality of the model is assumed to be 

A 

the same as that of the plant. Note that p is an estimate of p and matrices 

A A 

A (p) and B (p) must satisfy the conditions given by Eqs. (3.2.5) and (3.2.6). 
mm 

Let the model-plant error be defined as 

e = y - X (3.3.2) 

In the parameter estimation problems, when the maximum likelihood method 

is used, it is usually more convenient to v;ork with the negative of the 

logarithm of the likelihood function. It is possible to do so because the 

(51) 

logarithm, is a mono tonic function. It can be shown that the negative 
log-likelihood function J(p) is 


N 


A 

J(P) 




LSI 


T -1 

e.B.e. + log B. 
i a. i » i 


(3.3.3) 


where B. is an n x n matrix whose inverse is the covariance matrix of the 
plant’s state, that is 

b]*^ - E |^(yj^ - (3.3.4) 

and N is a fixed number which denotes the amount of data that need to be 
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collected before the minimization process can begin. Note that is 
the expected value of • 

It is important to have good starting values of the parameters 
since this considerably improves the probability of convergence and of 

locating the absolute maximum of the likelihood ftjnction. It is also 

.{ 

useful to determine if an appropriate model is being used etnd all param- 
eters are identifiable from the data* 

The basic iteration of gradient-type methods have the form 




(3.3.5) 


where is the parameter vector at the j-th iteration, f^ is a vector 
of gradients of the negative log-likelihood function J(p), i.e*, 




(3.3.6) 


and is an approximation to the second partial matrJbc i.e., 


[ 


3^J(P)/2)P^] 


A A 

p “ p^ 


(3.3.7) 


Note that ^ is a scalar step size parameter chosen to ensure that 

J(p^^^) < J(pJ - 6 , where 6 is a positive number that can be chosen 
, (25) 

in a variety of ways . 

The class of the gradient-type methods differ mainly in the 

(48) 

selection of , and in some cases and f ^ . It is shown that the 
convergence rate near the minimum for algorithm (3*3.5) with ^ ^ chosen 
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by a one - dimensional search is 


i ‘/‘max - 


Where ^ and ^ are the maximum and minimum eigenvalues of 
2 a2 

Sj(3 J/9 p ). It is clear from (3.3.8) that the best convergence is 

2 a2 

achieved by making as nearly as possible equal toY^ J/9p )” • Ih 
developing our IAS we use Nevrt:on-Raphson technique as a member of the 
gradient -type methods. 


For the Nevrton-Raphson technique is chosen as 


< P^) 


-1 


A A and (3 . » I except when this choice of Q . 
P » -Pj J j ' * J 3 


gives an increase in cost. VJhen this method converges, the convergence 
is quadratic. However, the method has the following drawbacks: a) It 

2 A 2 

fails to converge to the desired optimum whenever O has some 

2 2 

negative eigenvalues, b) If O J/3p ) is nearly singular, there are 
numerical problems in inverting it. This may result in slow or no con- 

2 a2 

vergance at all. c) Generally, the computation of (9 J/9p ) is very 

expensive. This is the main reason that we will consider various modi- 

1 ■ 

f ications of this algorithm by doing some approximations in order to 
save on comptjtation load. 

The first modification will be to get an approximation of 
2 2 • 

(9 J/3p ) rather than computing its exact value. Differentiating Bq. 
(3.3.3} with respect to p we get (assuming constant) , 


f. ■ "9 J/9 p I A A » 
J I P * Pj 








(3.3.9) 
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where ^x/^'p is the sensitivity matrix of dimension n x ■£ ( ^ is the 
mmber of unknown parameters) whose form is 


3 x/3p « 



3 Xj^/3p2 

... 9 x^/d p^ 

3 Pj^ 

d Xg/S Pg 

... 

• 

• 

• 

• ' . 

• 

• ' ' 

? Vap, 


... 


(3.3.10) 


11 ) 


Differentiating Eq. (3.3.9) x\rith respect to p we get 


U 

S~^ » [2^ ^ p)'^B^O x/3 P>il j = ( ^ ^J/3 p^) j J ^ A . (3.3 . 

t,S| 

Then substituting Eqs. (3.3.11) and (3.3.9) into Eq. (3.3.5) yields 


N f/ V 

p.^^ “ Pj + [^Ox/3p)^B^Ox/3p).y^ j (3.3.12) 

is, i-l j 


where O x/3 p), the sensitivity matrix, can be obtained by differen- 
tiating Eq. (3.3.1) with respect to p and interchanging 3/3p with d/dt. 
This yields 


d/dtOx/3p) » A (p)(3x/3p) +3/3p Fa (p)x + B (p)<rj] 
in ■ ■■ I m in s i 


(3.3.13) 


X constant 


Since the initial conditions of the model, x(0) , can be selected inde- 

A 

pendently of p^, the initial condition for the sensitivity matrix 
('c^x/Dp) is set to zero, (0). Hence, Ox/3p) can be obtained fay solving 
the set of differential equations given by Eq. (3.3.13). Since sj^ is 
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non-negative definite, one can always find ^ ^ j such that 

^ JCpj). The method, however, runs into the problem when 

i 

is singular or nearly singular, that when inverting we may obtain 
an indefinite S^, so that ^ JCPj) all ^ j ^ these 

cases it is better to obtain the parameter step hy solving the follow- 
ing linear equations 


- 1 .'' » 

^3 


- f 


i 


(3.3 


Several schemes 


( 25 ) 


are available for selecting 


?3- 


Since the 


calculation of J(p) is very expensive^ , methods which require several 
trial values during each iteration are not desirable. A simple 

procedure is to use « 1 in those cases v/here 




(3.3 


If condition (3.3.15) is not satisfied, J(p.), f., J(p. ,) and f. , 

are used to obtain a quadratic fit in and the stationary value 
^ ^ that minimizes the quadratic fit is computed using j? ^ f * 
The convergence criterion for this IAS is satisfied when the 
follov/ing conditions are met: a) The cost function J(p^^j^) is below 

some quantity ^ fixed by the designer, i.e. 




(3.3 


b) The vector close to its previous iterate, p^ in the Euclidean 


norm sense I.e* 


A A 

' j+1 ' 


P4.T - P.|| ^ ^ 2 


( 3.3 


. 14 ) 


. 15 ) 


. 16 ) 


. 17 ) 
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where 62 ^® determined by the designer, c) As we shall show in the 
following section (3.4) is the covariance matrix of Pj^^* Hence, 

its i-th diagonal element is checked to be smaller than the allowable 

A 

covariance of the parameter i.e. 

cov Pji^ (3.3.18) 

where cov p^ is determined by the designer and is the maximum tolerable 
covariance of the coefficient p^. 

Parameters p^ after they have passed the convergence criterion 
are packed into a parameter vector p of dimension 6 x 1 to be used in con- 
junction with the Learning Algorithm Subsystem (LAS) • 
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3»4« The Leamlnqf Algorithm Subsystem (LAS) 


The values of parameter vector p affecting the dynamics of the 
plant are fitted to predetermined analytical expressions which describe 

A 

the behavior of p over a range of the plant's operating conditions, de- 


noted by vector h • Assume the relationship between the parameter vector 
A 


p and the plant's operating condition vector h be described by an unknown 
memoryless system 

p » f(h) 

where X is the uncertainty vector of p with properties E(X) » 0 and the 
covariance matrix 


(3.4.1) 




E < 


v/here E(p) = f(h) 

A model is assumed of the form 


fp - E{p) .^p - E(p)J ^ 


(3.4.2) 


f(h) = H(h)C 


(3.4.3) 


where H(h) is the operating condition matrix whose elements in any row 
are linearly independent functions of h spanning the space f(h) . The values 
of p and H(h) to be used in the evaluation of the coefficient vector C are 
received sequentially from the information acquisition subsystem. Hence, 
an iterative form of the least-square method is appropriate for obtaining 
the coefficient vector C each time new information (p, h) is received from 
the IAS. The problem is to select an estimate of C, denoted by Cj^, which 


minimizes the quadratic criterion 0^ given by 


"ic 'X - “‘VSc] [Pl - 


(3.4.4) 


L=i 
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The subscript on Cj^ is used to emphasize that we are using k vector obser- 
vations to compute the estimate. At present, the subscript is superfluous, 
tut it will be helpful when we develop the sequential format in the sequel. 
Because the minimization of 0^ is an ordinary deterministic minimization 
problem, the least-squares estimate is obtained by setting 




(3.4.5) 


Differentiating Eq. (3.4.4) v/ith respect to yields 

k 

3 V2 =1= =■ - [?i - » 0 


(3.4.6) 


Solving Eq. (3.4.6) for we obtain 


Lsl 


(3.4.7) 


Note that the solution requires the inversion of the matrix 
k 

yn (h^)R^ H(h J . The existence of this inverse is essentially an 
<•-/ ( 69 ) 

observability requirement . For notational convenience, let us define 


P, as 
k 


i=l 


-1 


(3.4.8) 


In view of Eq. (3.4.8), Eq. (3.4.7) becomes 


Sc - "k 

is/ 


(3.4.9) 
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It is desired to establish an iterative formula to simplify the computations 


involved in Eqs. (3.4.7)and 0.4.9). To do so, suppose the data 


is available, then will be 


kti 




(3.4.10) 


t=» 


where 


kfi 


k+1 




-1 


(3.4.11) 




Taking the inverse of Eq. (3.4.11) yields 


K+» 




LSI 


LSI 


mvertdng Eq. (3.4.8) and then substituting into Eq. (3.4.12) yields 




(3.4.13) 


-1 


Since matrices ^^^+1^’ \+l (3.4.13) and ^k+1* 


P”^, nonsingular and H(hj^^j^) is of maximum rank, then is 


given by 


(69) 




p,, 1 » P^ - 

k+1 k k 


(3.4.14) 


Equation (3.4.10) may also be written in the form 


rx't? 
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k 

i=i 

Substituting Eq« (3*4*14) into Eq. (3.4.15) and rearranging terms yields . 
the iterative equation for 

S.X - V \»l] [P)C*1 - 

(3.4.16) 


The initial values of vector may be selected as the coefficients of 
the interpolation polynomials over some apriori available data about the 
variation of parameters over some range of admissible operating con- 

dition h. This is done .In order to start with a finite tncertainty about 
the coefficients C which l,s a necessary condition to guarantee stability 
of Eqs. (3.4.14) and (3.4.16) . The matrix may be chosen initially as 

any positive definite matr.ix, and this choice influences considerably 

(71) 

the rate of convergence and stability of the algorithm . 

A 

To provide an estimate of the covariance matrix R of vector p let 
? - E (0) r - Pj (3.4.17) 

Substituting Eq. (3.3.12) into Eq. (3.4.2) and using the approximation 
given by Eq. (3.4.17) yields 

V • r ^ ^ 

R a |“^( 5 x/^ p)^B(DX/ 5 pO ^ 5 pO 

[^(^x/3p)^BOx/3p)^J ' (3.4.18) 

til 
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where the bar denotes expectation. Letting 


T 


B 




13c 


(3.4.19) 


where 


S: 


Uc 


0 for 1 3c 

1 for 1 «» 3c 


(3.4.20) 


the covariance matrix R of vector p reduces to 


N 

R 3 |^^( 3 x/9 p)T B O x/ 5 p) ^ 
L~i 


-1 


(3.4.21) 


Equations (3.4.14), (3.4.16) and (3.4.21) constitute the i:^aming Algo- 
rithm Subsystem (LAS) . 

To obtain the goodness of the least-squares estimate let us 
define the estimation error as 


■1== =-=lc 


Substitution of Eq. (3.4.9) into Eq. (3.4.22) yields 


(3.4.22) 


since 


c-t 

p^ a H(h^)c + 


(3.4.23) 


(3.4.24) 


Therefore, substituting Bq. (3.4.24) into Eq. (3.4.23) gives 

k 

^ » C - H^(h^)R^^ ^H(hjj)C (3.4.25) 
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In view of Eq« (3«4.8) 

k 

^ (3.4.26) 

c=i 

Taking the expectation of Eq. (3.4.26) we obtain the expected value of 

/-W 

k 

E(^) » - Pj^ 2 H^(h^)R2^Aj^ » 0 (3.4.27) 

6zl 


Hence, Cj^ is an unbiased estimator. The covariance of the least -squares 
estimation error is given by 


k Ic 


(Cov)^ - [J. VK'”"'3’] '’k 


(3.4.28) 


€-Sl /i( 


In view of Eq. (3.4.2), Eq. (3.4.28) becomes 


K 

(CV)^ - '■k [XH’'(h^)R-lH(hj^)] . P, 


(3.4.29) 


A necessary test for the LAS must be to continually evaluate the 
validity of the information in vector Cj^. This test is performed by the 
confidence criterion which is satisfied when the following conditions 
are met: a) The mean square error is below some quantity £ fixed by 

the designer i.e. 

< (k + 1) £ 1 (3.4.30) 
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b) The vector is close to its previous iterate Cj^ in the Euclidean 
norm sense i.e* 


IIS»l-=k|l 


(3.4.31) 


where again £ 2 i® ® number determined by the designer and depends upon 
the desired accuracy, c) The i-th diagonal element of matrix Pj^ is 
smaller than the allowable covariance of the coefficient C^. Since 


<Vii = E - =] 


(3.4.32) 


then condition (c) reduces to 




(3.4.33) 


where the quantity cov is fixed by the designer and is the maximum 
tolerable covariance of the coefficient C^. 

Note that used inEq. (3.4.30) is given iteratively by 

(3.4.34) 

where 0^ is defined by Eq. (3.4.4). 


IppMmSMM 
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3.5» The Kemory and Control Process Subsystem (NCPS) 

The plant's dynamic characteristics vary over differ^t opera- 
ting conditions h, therefore, the overall objective of the learning 
control system is to maintain uniform dynamic quality parameters over 
all possible operating conditions insofar as possible. That objective 
might be compromised because the feedback gains required to maintain 
uniform dynamic quality characteristics may be excessive. Excessive 
feedback gains can excite instabilities caused by nonlinearities of the 
plant • 

In order to alleviate this possibility, the memory stores the 
values of vector Cj^ after they have passed the confidence criterion 
test. The control process subsystem, v/hich consists of control laws, 
then conpites the corrections Z\g and /\ K needed for the feedforward 
and feedback gain matrices G and K respectively using the stored val- 
ues of vector C^. In order to design the control lav;s for maintaining 
uniform dynamic quality parameters a design criterion is selected that 
yields a relation between the feedfon-;ard and feedback gains as a 
function of the parameter vector p and hence a function of the coeffi- 
cient vector In summary, the control law is designed to meet a 

design criterion over a range of operating conditions to yield an oper- 
ationally feasible control system. 

Imposing the constraint that the plant's handling qualities 
remain the same over all possible flight conditions we get from Bqs. 
(3.2«5} and (3.2.6) 

A (p) + B (p)K a constant 
P P 

Bp(p)G « constant 


(3.5.1) 





46 


Applying the difference operator on both sides of Eq. (3.5.1) gives 


A Ap(p) + [ABp(p) j K + [Sp(p)j Ak - 0 
[a Bp(p) J G + [^Ep(p)j A G a 0 


The solution of Eq. (3.5.2) for A ^ Ag yields relationships 
which constitute the control laws given by 




v*ere i = 1,2, ,n and 

Ag > - |B^(p)Bp(p)] B^(p) 

Note that superscript i on the matrix A A (p) denotes the i-th 

P 

column vector matrix A A (p). Equations (3.5.3) and (3.5.4) readily 

P 

apply as the control laws for the longitudinal dynamics Ixit not for 
the lateral dynamics since in the latter case the two inputs to the 
plant are not linear combinations of all the states. The lateral 
control lav/s will be developed in sections 4.5 and 5.4 accroding to 
the specific application. 

The next chapter presents the application of the LCS to a high 
order representation of the F-8 DFBl*/ aircraft dynamics. 


,(p)' 


(3.5.4) 
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CHAPTER 4 

CASE STUDY I| THE APPLICATION OP THE LCS 
TO A HIGH ORDER REPRESENTATION OF THE P-8 DPBW AIRCRAPT DYNAMICS 

4 » 1 . Introduct ion 

To describe the behavior of an aircraft, one must study the non- 
linear equations of motion of the aircraft# A solution for these equa- 
tions may be obtained by the use of analog or digital computers or by 
manual numerical integration# In most cases, however, by the use of 
proper assumptions, the nonlinear equations can be decoupled into txvo 
sets of equations describing the longitudinal and lateral dynamics sepa- 
rately# In order to obtain the two sets of equations, one may consider 
the aircraft to be in straight and level unaccelerated flight and then 
to be disturbed either by deflection of the elevator or by the aileron 
and rudder# The elevator deflection applies a pitching moment (longitu- 
dinal) but does not cause a rolling or yawing moment ( lateral) . On the 
other hand, a rolling and a yawing moment generated by the aileron and 
rudder disturbance excites angular velocities about all three axes; thus, 
for certain cases the equation cannot be decoupled. The pitching moment 
that is generated from a lateral motion results from second order terms 
of yaw rate and roll rate thus, if the perturbations are small, and if 

one assumes that yaw rate and roll rate are so small that their cvcoducts 

and squares can be neglected, then the equations can be decoupled* 

j This chapter describes a si alation of our le^^pxing control system 

(LCS) on a digital computer as applied to the decoupled longitudinal and 
lateral dynamics of the F-8 DFEV/ aircraft. The simulation is not concerned 
with real-time while the aircraft is flying. In this study, as a first 
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attempt of applying the LCS system, fourth order linearized representa- 
tions were used both for the modeling of the longitudinal and lateral 
dynamics of the plant* The main concern was to get insight into the 
properties and the behavior of the LCS. 

Another aim of the work described in this chapter wa?s to compare 
two different learning control systems based upon tivo different Informa- 
tion Acquisition Subsystems (IAS): (1) Liapunov Method and (2) Newton- 

Raphson Method • 

V/e v/ill consider a more realistic application of our LCS in the 
following chapter (Chapter 5) where we use non- linearized equations for 
the plant and v/e are concerned with the real time application. 

This chapter is divided into five sections. The first two sec- 
tions v/ill describe the longitudinal dynamics and the LCS as applied to 
the longitudinal dynamics . The next two sections will describe the 
lateral dynamics and the LCS as applied to the lateral dynamics. The 
last section will discuss the simulation with the associated results and 


some concluding remarks 


4»2. The Longitudinal E)ynamlcs 


For the longitudinal case the state variables of the aircraft 
system are pitch rate q, forward velocity v, angle of attack *=< and pitch 
angle 6. The linearized equations of motion at selected flight conditions 
have the matrix form 


c< 

8 


'10 


0 

1 

1 


0 

-g 

0 

0 



- — 


- 


<3 


P7 


V 


Pa 


c< 


P9 

J 

^8 


.0 . 


j; 


(4.2.1) 


J". 


where 0 ^ is a linear cor^ination of the states and the pilot's input 


given by the following equation 


(4.2.1a) 


Note that 0 is the pilot's (stick) input and K and G are gain matrices 

S . " 


as defined in Chapter 3* Parameters through p^^^ are functions of the 

( 21 ) 

aircraft's stability derivatives given by the following equations : 


p. » (qSC/2V I ) (C„ +C ) 
1 o y n . 


PlO =. (qSC/V^Iy) 


- 

m 

*- u 


(qSC/2mV )C . 


o m. 




P2 - (57.3,SC/y - (qSC/2mV^C^^ 


P3 - - (qS/mV^)Cjj 


u 


p^ - (57.3qS/m)C!^ 
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Pg » (qS/mV^)Cj^ Pg » <57.3qS/mV^)C^^ 

p- » (57.3qSC/I ) fc - (qSC/2mV^)C C_ ^ 

7 ^ I "*/ ° 

6 


■8 


(57.3qS/m)C. 




Pg - (57.3qS/mV^)C:^ 




(4.2.2) 


The stability derivatives affect the dynamics of the aircraft and as an 
examole we v;ill discuss the effects of C 

^e< 

The static longitudinal stability of the aircraft is determined 
by the term C which is the change in the Ditching moment due to a 
change in the angle of attack. For a ’’statically stable aircraft” this 
term must be negative. A statically stable aircraft is one that tends to 
return to its equilibrium condition after a disturbance has occurred. A 
negative C means that as the angle of attack increases positively, the 
pitching moment becomes more negative tending to decrease the angle of 
attack. The opposite is true for a positive C 

™(X 

Although all aircraft are designed to be statically stable, C 

^o< 

negative, certain flight conditions can result in large changes in the 
longitudinal stability. A severe shift in the longitudinal stability 
results in some high performance aircraft at high angles of attack, a phe- 
nomenon referred to as ’’pitch -up”. As long as the slope of the versus 
<X| curve is negative the aircraft is stable but as the angle of attack 
is increased, the slope changes sign and the aircraft becomes tonstable. 

If corrective action is not taken, the angle of attack increases until the 
aircraft stalls. This usually happens so rapidly that the pilot is unable 
to control or stop the pitch-up. A practical solution of the pitch-up 


53 


problem is to limit the aircraft to angles of attack below the critical 
angle of attack; however, this also limits the performance of the aircraft. 
In order not to limit the aircraft’s perfotrmance, the longitudinal con- 
trol system is designed to make the aircraft flyable at angles of attack 
greater than the critical angle of attack- To design properly the longi- 
tudinal control system, we need a good estimate of the longitudinal param- 
eters which is provided by our Learning Algorithm Subsystem (LAS) • Our 
metliod of development of the longitudinal LCS is described in the following 


section 
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4.3 » Realization of the LCS as Applied to the Longitudinal Dynamics 

As indicated In Chapter 3, to implement Liapunov's IAS, a model 
described by Eq. (3.2.4) is used v/hich for the longitudinal dynamics 
has the form 


m 

e 

*1 


-10 

0 

0 

0 


^1 

^2 


0 

-10 

0 

0 



^3 

* 

0 

0 

-10 

0 

i 

^3 

• 

^4 


0 

0 

0 

-10 


®4 



- 







Pl+10 

0 

1 

1 


10 

3^ 


p.+lO 


p ^+10 


0 

-0 

0 

10 


q 

V 

eK 

e 


■ A 

P. 


•8 




(4.3.1) 


or in equivalent form 


A 

P 

A 

Ps 


10 

3 


A 


A 

A 


“9 

0 


1 

0 

0 

0 

Jl 


m 

-10 

0 

0 

0 



0 

-10 

0 

0 


®2 

0 

0 

-10 

0 

K 

®3 

0 

- , 

0 

0 

-10 


■ 

®4 

p* •• 


•A ^ 

P. 


fs 


(4.3.2) 


where p^ is an estimate of p^ and 
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are of thixd order because there are no unknown parameters in the fourth 


differential equation therefore, e^= 0. The matrices Q^, re- 
duce to be scalars because there is only one column in matrix B^(p) • The 
infinity elements in Ng and matrices indicate that the corresponding 
elements of the matrix is known* For example, the first diagonal 

element of matrix being oo shows that the first element of the third 

row of the A (p) matrix is knov/n, namely 1. After these choices have 
ra 

been made (Eqs. 4.3.4) the adaptive algorithm equations given by Eqs. 
(3.2.16) and (3.2.17) reduce to 

-fc 


10 


A 

Pi 


A 

P/ 


^i(o) 


^10 


l(o) 


(l{o) 


^;<o) 

P 4 


- ( 30e^q/ . 

® * 

- fc30e3^. 

f 

(30e^o^/. 

■ i 

■I 

■I 


01) dt 


v/10) dt 


0001) dt 


(30e2V/10) dt 


^(o) 


A(o) 


■I 

i 

■I 


(30e2®^ /.OOl) dt 


(lOe^v/lO) dt 


( 306^ / .0001) dt 


■b 

Pj - Py®^ - J (SOe^^iTg/ .00001) dt 


(4.3.5) 
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P3 " Ps 


(30e, tT /. 00001) dt 
2 s 


Pg - Pg 


- (30e^<f^/,0000V dt 


1 ( 0 ) .. 


where is an initial estimate of parameter pj^« 

For the Newton>Raphson IAS a model described by £q. (3.3*1) is 
used which for the longitudinal dynamics has the form 


+ A ^ Os 

X3 


(4.3.6) 


In this case the sensitivity matrix (Eq. 3.3.10) has the dimension 

3 X 10 with a typical element (the i-j element) Tix./^p. (i^ 3, j ^ 10). 

1 j 

Each element of the sensitivity matri:-: satisfies the differential equation 
(3.3.13) v;hich is used to evaluate that element. For example, the ele- 
ments snd "^x^/dp^ satisfy the follov/ing differential equa- 

tions respectively 


d/dt( ’9 X2/3 P 3 ) =« P^Ox^/SPj) •t' P^('^ + ^2 


(4.3.7) 


d/dt( 3 x^/a p^) » OXj^/^p^) +P 2 ( ^ X 2 /dPi) + Pq< x,/ S Pj^) 


Note that the other elements of the sensitivity matrix are not Shown here 
but are obtained in a similar way. After all sensitivity elements sure 
obtained , they are substituted into Eq. (3.3.12) to yield an improved 
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estijnate of vector p. 


In this study we used data from reference 


and considered the 


four wing-down (CO) configurations. The simulated flights were at three 
different altitudes (L a sea level, L a 20,000 ft., L a 40,000 ft.) and 
three different mach numbers <M a 0.5, M a 0.7, H = 0.9). The first 
step v;as to interpolate polynomials through the data in the least square 
sense and determine a functional representation of the model system pa- 

A 

raraeter p^ with respect to mach number M and altitude L for the purpose 
of establishing the model as a function of M and L. These functional 
representations ara used for the Learning Algorithm Subsystem (LAS) and 
were determined so that the mean square error betv/een the curves and the 
data was belov/ a pre-specified botind. The functional representations of 

A 

the p^ parameters are of the form 


?1 a p^(K,L) » ‘"4*^ ^ 

» P2(^^L) = Cg + CgM + C^L + CgM L + + C^^L^ 

?3 - P3(M,L) = C33 ♦ C33M * C33L + L + 

P 4 - P4<K.« = C,7 - L 


’17 "*18 19 " 20* 


Pj ■ PjCM.L) . C33 + CjgH + C33L + CjgH L + 

Pg . Pg(M,L) = Cjg + Cggfl + C33L + C33II L ♦ C33H ♦ CjgL 


p, . p^CIf.L) . C35 ♦ CggH 7 C3,L ♦ CjgH L ♦ 


Pg . $g<M,L) . Cgj_ * CgjM ♦ C^L ♦ C^gM 1. 7 Cggll^ ♦ CggL^ ♦ 




Pg " P9^*■^^J ■ ^0 ^ 52 ^ ■*“ ‘“53^'^ ^ * ** 55 ^ 
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PlO - • =56 * S?” * =58^ ♦ =59" ^ * =60'-'^ * =61.''^ * 

+ CgjL m 2 * CgjM 1 } * + CgjM* ♦ CggL 


After establishing tlie model as a function of M and L we applied our 
LCS as described in the previous chapter and the results are discussed 
in section 4.6# 

As an illustration of the LAS consider one parameter curve p^(M,L) 
and assume for simplicity that 


p^{lI,L) a 4 C^H + CjL » fl M L|* 

L 3 


= H(M,L)C 


(4.3.9) 


Now assume the aircraft flies at mach number and altitude and the 
IAS generates Pj^(M^,L^). ' Interpolating v;ind tunnel data we have an a priori 
yoiowledge of vector C; namely 


(4,3.10) 


Let P = I-j ( Identity matrix: and initial covariance matrix of vector C) . 
The LAS updates vector C according to 5q. (3.4.16) 


■ 1 ( 0 ) 


c 

♦ X ♦ + X,2)‘"|j^(M^,L,^). [l K^lJx Cj I 


(4.3.11) 
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where r^^^ is the first diagonal element of matrix R (covariance of \ 
given by Eq. 3.4.2). 

For simplicity let - 1. Equation (4.3.11) yields 


^1°^+ pi^^^l’V “ * ^1^ 

4°’v =3°’4>] 
[Pl^“l’V’^4°^-^ 4°^V 4°^h^] + Li> 


0^®^+ M 


(4.3.12) 


The updated covariance matrix is computed according to 
Eq. (3.4.14) 


1 

0 

0 


1 

0 

1 

0 

- 

”1 

0 

0 

1 




r X •! 

X (r^^j^ + 1 + + Lj^) X 


"'i h] 


(4.3.13) 


Equation (4.3.13) yields (for r^^^ » 1) 


1 + + L J 


- L, 


- M, 


- L, 


2+^1 - ”1^1 

- 2 * 


/(2 + + L^) 


(4.3.14) 


Next assume the aircraft flies at mach number M 2 and altitude 

A 

and the IAS generates (M2»^2^* Applying the second iteration of the 
learning algorithm (Eqs. 3.4.14 and 3.4.16) we obtain an updated vector C 
with elements 


: »•»•■***•«>•**■* ■>•■ -'‘••W i L mi n*ii irt f' i il* 1 1' "i 
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^ (1+mJ+L^-M^M2-L^L2)/D 

^2^^" C 2 ^^+ - P^(M2,L2)-(C^^^+C2^^M2+C3^^L2)J(2M2- Mj^+M 2 L^-Mj^Lj^L 2 )/D (4.3.15) 


and matrix P, 


2 1 


2 

‘2 + 

(H 3 - 1 -^)= * (I.J 

- Lj)2 ♦ 

(4.3.16) 

J? 

J,J^ J,J, 



1 

12 13 



'^l '^2 

jI J 3 J 3 

/ D 

(4.3.17) 

V3 

J 3 J 3 4 




. 1+ I-lJ L^ . ,.y.,^ - 


Jj . 2M2 - ^ . M^I.^1,2 


J3 - 2I-2 - 4 * L3H3 - 


(4.3.18) 


and is given by Eq. (4.3.14). 

The confidence criterion gives according to Eqs. (3.4.30) and 
(3.4.34) 

®1 “ /^l^^l’V “ 

'* ^ U 

02 - +[pj^(M2’V " 4^^"^ ”2 4^^ ^2^}^- 


(4.3.19) 


For the longitudinal LCS the elevator command structure is 
" ^ 1 *^ ■*" ■*■ ^ 3 ®^ ■*■ (4.3.20) 
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Imposing the constraint that the plant' s handling qualities remain the 
same over all possible operating conditions, Eqs. (3*5«1) and (4*3 *20) 
ar.e used to obtain relationships which constitute the control laws given 

by 

* Pio’'1'^Pio] ^'Pa ♦ P9 * Pi^ 

AlCj . -rpgCkjaPg+APjl+PjfkjAPj+Ap^j+Pj^gCkjAPjgtAPj)] /(p|49+Pio) 

(4.3.21) 

A kj - -rpg(kjAP3+Apj)+Pg(lCjAP9+AP5)+Piot''3^PlO+'^P7'J^*P3*P9'*’^lo’ 

Ag . - (PgGAPg * PjGAPg + P^(jGAp^(j)/(j| + Pg ♦ P^g) 


where the p^ are the learned parameters obtained from LAS and A p^ is 
the difference between the learned p^ at the current time and the previous 
time. Note that the p^ are computed using the Cj^s stored in the memory. 

In the next two sections we will discuss the lateral dynamics and 
the development of our LCS as applied to the lateral dynamics. 
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4.4. The Lateral Dynamics 

For the lateral case the state variables of the aircraft system 
are roll rate p, yaw rate r, side slip angle and roll angle . The 
linearized equations of motion at selected flight conditions have the 
matrix form 


'4> 


Pi 


cos a^ sm a^^ 


0 

0 

g/v^ 




^10 ^11 


^12 ^13 


'14 


«Ta 

/r 


(4.4.1) 


S' ^ is the aileron displacement and S' _ 

Angle is the trimmed angle of attack at each flight condition and the 


where d is the aileron displacement and J the rudder displacement. 

& jET 


parameters p^ through p^^ are functions of the aircraft’s stability 
derivatives and angle a^ and are given by the following equations 


I 2 * ’ ’ 2 

p- « L cos a_ - L sin a_cos a_ - N cos a_ sm a, + N sm a_ 
*^1 r f r f f p r “ ■ f r f 


• ' 2 ’ 2 * 
p- = L sin a^ cos a_ + L cos a_ - N sin a_ - N cos a_ sin a- 
^2 p f f r fp fr f t 


p^ = L/S cos a^ - Nyo, sin a^ 

♦ I 2 ’2 ' 

Pi = L cos a_ sin a . - L sin a_ + N cos a_ - N^sin a- cos a- 
*^4pf frfpfpf f 

• . 2 * ' * 2 
p_ =» L sin a^ + L cos a_ sin a _ + N sin a. cos a_ + cos a- 

5 p f r r f p r r r x 


Pg » L /5 sin a^ + Nji cos a^ 
p^ - Yp cos a^ - sin a^ 


(4.4.2) 


p_ a Y sin a- + Y^ cos a- 
p f r X 
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Ps ■ V 


f V 

« Lj' cos - Mj- sin 


I I 

^ J* ^f 

^ IT 


t r 

p^2 ■ '*' 

a a 


^13 “ ^f ■*■ ^J* ®f 


p, , » :f 


14 “ V 

■* <1 


v/here 


h - [^i ♦ [i - <wv=>] 

“I ■ [“i * <wy ^i] ' [^ - <4^w] 


and i » p, r, ^ 


The tinoriraed derivatives are defined as 


L a (qSh‘^/2V^I )C, 
o o ^ 


L a qSb /2V I )Ct 
r 0 x 1 . 


a (57.3qSb/I )C, , 

V ^ "*’yj 


Ly » (57,3qSb/I >C-, 


a (57.3qSb/r)C^ , 

Jr ^ V_ 


N a (qSb /2V I )C 
p ^ o s n 




Nj 5 • (S7.3qSb/yc 


. (57.3qSb/I.)C , 

4 . 


Mj- . (57.3qSb/I )C„ 
r J K 


Yp a (qSb/2nV‘)Cy , 

P 


Y a (qSb/2mV‘)C 
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- (57.3qSb/mV^)i 
Zf » (57.3qSb/mV 

As an example, v;e will discuss the stability derivative and 

how it affects the dynamics at the aircraft. is the change in the 

P 

rolling moment due to a rolling velocity, arising from the change in the 
angle of attack on the wings caused by a rolling velocity. The down-going 
wing experiences an increase in angle of attack while the up-going wing 
is subjected to a decrease in angle of attack. These changes in angle 
of attack cause changes in the lift and drag of the up and down going 
wings which, in turn, produces a moment opposing the rolling velocity. 

Analytical, results shov; that' tlie rudder input ( <S' ^ excites mainly 
the side slip angle y3 and yaw rate r. This is called the Dutch roll 
mode. In the same manner aileron displacement mainly excites the 

a 

roll angle ^ and roll rate p. This is called the Rolling mode. 

There exists a coupling betii/een the Dutch roll mode and the Rolling 

mode mainly due to the existence of C (v/hich is the change in the yawing 

n 

P 

moment due to a rolling velocity) and (which is the change in the rol- 

ling moment due to a yawing velocity) . 

The cause for C is the same as that of . The change in the 
^P P 

angle of attack as mentioned previously, will result in the lift vector 
being tilted fonvard on the dovm-going wing and rean-/ard on the up-going 
wing therefore producing a negative yawing moment (an adverse yaw) for 
a positive roll rate. 

arises from the changes in the lift on the wings resulting from 
r 


a yawing velocity. If the aircraft is subject to a yawing velocity, the 







(57.3qSb/mV )C 




)C. 




relative velocity of the left and right wing panels changes with respect 
to the air mass. The forward going wing experiences an increase in lift 
while the lift on the rearward going wing decreases. This factor caused 
a positive rolling moment for a positive yawing velocity. 

One of the functions of the lateral control system is to provide 
artificial damping of the Dutch roll. To do so, we need to have the lateral 
parameters as a function of mach monber M and altitude L, since the tran- 
sient response of the aircraft varies considerably with changes in airspeed 
and altitude. The lateral parameters are provided by our lateral LAS 
v;hose development in conjunction with the IAS is presented in the following 


section 
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4.5. Realization of the LCS as Applied to the Lateral Dynamics 

The model used to implement Liapunov's IAS for the lateral dynamics 
of the P-8 DFBW aircraft has the form 


Pl+10 

A 

P 4 

A 

P? 

cos a^ 


.10 0 

0 -10 

0 0 

0 0 


A 

P 2 

P5+IO 

A 

Ps 

sin a. 


0 

0 

-10 

0 


0 

0 

0 

-10 


0 

0 


Pg+10 g/v^ 
0 10 


10 Pll 


^12 ^13 
0 Pl4 


/a 

<Tr 


(4.5.1) 


or in equivalent form 


a 


A. 

A 

A 

” 





Pi 

P 2 

P3 

0 


p 

» 


A 

A 

A 




^2 


P4 

P5. 

Pe 

0 

X 

r 

^3 


A 

P 7 

A 

Pa 

A 

P 9 

g/v^ 



1 

i 


COS a^ 

sin 

0 

0 


0 



A 

PlO 

A 

Pll 




+ 

A 

P 12 

A 

Pl3 

X 


+ 


0 

A 

?14 


fr 



0 

0 


» — 

' 


-10 

0 

0 

0 


0 

-10 

0 

0 


0 

0 

-10 

0 


0 

0 

0 

-10 


(4.5.2) 


vd^ere p^ is an estimate of and 
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(4.S.3) 


The matrices P, and involved in the Liapunov function are chosen 
as follows: 

fao 0 0 1 


P a 


0 30 0 

0 0 30 



0.01 0 0 

0 0.0001 0 
0 0 0.00001 



0.001 0 0 
0 0.00001 0 
0 0 0.000001 


(4.5.4) 




0.01 0 0 

0 0.0001 0 
0 0 0.00001 

0.00001 0 
0 0.00001 

" co 0 

0 0.00001 
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The same reasoning as before (see longitudinal dynamics) Indicates that 

the P and N. matrices are of third order. Note that e. =0, Q, matrices 
1 4 i 

are of second order and matrix B^(p) has two columns. The infinity 
element of matrix indicates that the first element of the third row 
of matrix B^(p) is Icnown, namely 0. The adaptive algorithm equation used 
to identify the parameters of the lateral dynamics are obtained by substi- 
tuting Bqs. (4.5.4) into Eqs. (3.2.16) and (3.2.17) and are given by 


V 

Pi = Pi°^- J (30eiP/0.01)dt 

0 

?2 » P2°^- J (30eir/0.0001)dt 

o 

t 

J ( /O . 00001 ) dt 


?3 -P 3 


A J(30e2P/0.001)dt 


P 4 -P 4 


Pg » Pj®^- J (30e2r/0. 00001) dt 
o 
t 

Pg - Pg°^-J (3062^ /0 .000001) dt 
o 

p^ » Py°^- j OOe^p/O.ODdt 


A a(o)_ j (SOe^r/O.OOODdt 


P3 “ Ps 


A(0) 


./ 0 . 00001 ) dt 


0 

■fe 

PlO “ PlO^' / C 3061^^/0 .00001) dt 


(4.5.5) 


a(o) 
11 “ Pll 


- J (306i X^/ 0 . 00001 ) dt 




Pl2 - - J (3062 J^/0. 00001) dt 

o 

t 

Pl3 - p^j^- J* (3062/^/0. 0000 l)dt 

o 

■k 

^14 ’^14^"/ (30e2«//0.00001)dt 


A (o) 

Again p^ is an initial estimate of parameter p^^. 

In order to implement the Newton-Raphson IAS for the lateral 
dynamics j a model of the follov/ing form is used 




’a 

A 

A 



•• 


"A 

A ' 


. — 

^1 


Pi 

P 2 

P 3 
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‘^1 


PlO 

Pll 
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A 
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P 4 

P 5 
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^2 

+ 

P 12 

Pl3 


• 



A 

A 

g/v^ 





A 



^3 


P 7 

Pa 

P 9 


^3 


0 

Pl4 


/r 
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cos a_ 

sin a_ 

0 

0 




0 

0 
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f 

f 




4 

• «■ 

, 






(4.5.6) 


In this case, the sensitivity matrix has dimension 3 x 14 v;hose i - j 
element is 3 Pj (1—3, j ^ 14) . The differential Eq. (3.3.13) is 

solved to evaluate the elements of the sensitivity matrix. Once the 
sensitivity elements are obtained, they are substituted into Eq. (3.3.12) 

A 

to yield an improved estimate of vector p. 

For the lateral simulation, v/e once again used data from refer- 


ence (21) and considered the four wing -down (CX» confipirations at three 
different altitudes (L sea level, L * 20,000 it.., L » 40,0®® ft.) and 
three different mach numbers (M = 0.5, M = Q.7, M * 0.9) . The sense 
procedure used for the longitudinal case was followed through the lateral 
case except that the interpolation poiffnomials used- to represent the sys- 
tem parameters p^^ with respect to mach number M and altitude L were only 

ORIGINAL PAGTO IS 
OF POOR QUALITY! 
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expanded up to second order terms resulting in the following equations: 

^1 “ ■ ^1 ^ 2 ” * ^3^ * ^4^’^ * 

?2 - PgCKfU - C7 + CgM + CgL + C^qML + ] 

Pg » P2(M,L) a + ^14^^ + ^15^ + ^16^ ■*“ ^17^ ^18^ 

P4 - - =19 ♦ = 20 « + + =22“ * =23”^ * =2/ 

P5 • " =25 =25'’ * =27^ ♦ =2S’* ♦ =29”^ ♦ =30^^ . 

% = Pg"-'.!-) - * C32II * C33L H. C34IIL + C3gL^ 

% . p,(M,L) - C37 + C3gM ♦ C33L + C^m, + + c^l? (4.5.7, 

Pa * Ps^'’’“ ’ =43 ■" =44“ =45^ * =45^"^ * =4t'-’^ * = 48 ^^ 

P 9 - “ =49 ♦ =50'-‘ ♦ =51^' ♦ =52'-®' * =53''’^ * =54^^ 

P30 =■ P3q(M,L) a C33 + CggM + C33L + CggHL + Cggl-J + Cggl. 

hi = Pu»'-^> = =61 ♦ =62" ♦ =63*^ * =64'-^ * =65^’^ * =66^^ 

P 12 ' Pl2«'“ - =67 * =6S" * =69^ * = 70 "^ * =7l"^ * =72^^ ; ! 


^13 “ 
Pl4 = 


a C—a, 

+ 

+ C.f-Ii + 

+ C__M^ 

+ C__L^ 
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* =84^^ 


Once the relationship between the lateral parameters as a function of 
mach number M eind altitude L has been established Ec[s, (4.5.7) the 
application of the LAS is implemented by the iterative application of 
£qs. (3.4.14) and (3.4.16). 


For the lateral LCS the aileron and rudder command structures are 
vT_ - K,p + o S 

* ^ t si (4.5.8) 

J"r -Kjr * °2'’'s2 . 
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Imposing the constraint that the plant's handling qualities remain the 
same over all possible operating conditions, Ekjs. (3.5»l)and (4.5.8) are 
used to obtain relationships which constitute the control laws given by 

A Ki . - [S^^(K,A P,o ^ A Pj) * p^CK^A p^ * A p^)] 


A K2 ■ -^p^^(K2Ap^^ + A P2> + P^3 ^ P5^ + 

A A \1 //'' 2 A 2 A 2. 

+ Pi 4 <K 2 APi 4 + APg) + p ^3 + ) 


A 


K. 


3 ■ 




* * Pl3'‘'3‘^Pl3 * 


A A //'' 2 A 2 A 2. 

+ Pl4tI<3APi4 + A P9)|/(Pii + Pi3 + P34) 


(4„S.9) 


A = 


^PlO^l^PlQ ^12^1^ ^12^ ^10 ^ ^12^ 


A 


^^ll®2^^11 * ^13^2^ ^13 * Pl4'^2^Pl4^^^^11 ^13 * ^14^ 


where p^ are the learned parameters obtained from the LAS and A p^ is the 

A 

difference between the learned p^ at the current time and the previous 

A 

time. Mote that p^ are computed using the C’j^s stored in tl-je memory. 

This process v;as illustrated in section 4.3 and is carried out for the 
lateral case in the same fashion. 
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4.6 « Results and Discussion 

Our LCS was applied to the longitudinal and lateral dynamics of 
the P-8 DPBV/ aircraft. The complete system was simulated on a digital 
computer as shown tjy the flow diagrams of figures 4-la (Liapunov's IAS) 
and 4-lb (Nev/ton-Raphson IAS) . The learning of twenty four aircraft 
parameters (ten for the longitudinal dynamics and fourteen for the lat- 
eral dynamics) have been evaluated. Six of these parameters (longitudi- 
nal p^, p^, pg and lateral p^, p^, p^^) are presented in Figs. 4.2-4.13 
to emphasize the learning performance of the two different LCS. 

A comparison betv/een Pigs. 4-2 & 4-5 and 4-4 & 4-7 shov/ that the 
learned parameter curves using Ne'rton-Raphson IAS give a better approxi- 
mation to the plant parameter curves than the ones obtained by using 
Liapunov's IAS. A comparison between Figs. 4-3 & 4-6 show that where 
Liapunov's IAS failed to obtain a better approximation of the initial 
parameters, Newton-Raphson IAS did not. These comparisons are based upon 
equal time operation of the two learning control systems, while both the 
information acquisition subsystems passed the convergence criterion . 

Since the convergence criterion has been satisfied in both IAS’s there 
is no need to further extend the monitoring of the adjusted parameters. 

It is assumed that once the parameters pass the convergence criterion 
little or not additional information is gained by fur'ther operation of 
the system. 

These observations indicate that the performance of the learning 
control system is strongly affected by the accuracy of the IAS. The com- 
parative results show that the LCS using the IAS based on Newton-Raphson 
method produces better learning of the parameters. This, in turn, means 
that the gain schedule adjustments were more accurate than the adjustments 


OF pCOtl 
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obtained from the LCS based on Liapunov’s direct method IAS. These ex- 
perimental results and the comparative observations were described and 
presented at the NASA session of the 1976 IEEE Decision and Control 
Confer^ce, Clearv/ater, Florida, December 1976. The presentation is 
published in the Proceedings of this Conference' . 

It was also observed experimentally that for a low amplitude input 

signal S in both Liapunov* s and Nex'/ton-Raphson* s IAS the measurement 
s 

information content, y, is not adequate to obtain a useful estimate vector 
p because the process attempts to fit the noise. This indicates that there 
exists a need to make tests on neasxirement information prior to modifying 
the estimate of parameter vector p. For the Liapunov IAS this test was 
incorporated in the convergence criterion described in Chapter 3. For 
the Nev/ton-Raphson* s IAS lov;er bounds v;ere placed on the diagonal elements 
of the information matrix I, given, by 

V 

I x/^p)^ x/d P). (4.6«1) 

til 

_ A ^ 

prior to allowing adjustments of the estimate or parameter vector p. The 
bounds v/ere determined by examining the variation of the appropriate in- 
formation matrix elements during steady-state conditions with no input 
signal applied. In that case, the variations were caused by measurement 
noise so that any variations of the elements below the selected bounds 
were regarded as being caused by measurement noise and therefore, the 
estimate of parameter vector p vras not adjusted. 

Since Newton-Raphson method proved to give better results for the 
IAS therefore, this information acquisition procedure is incorporated 
vM-ith the LAS and the Memory and Control Process subsystem (MCPS) to form 
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a LCS which is applied to NASA' s real-time simulation of the F-8 DPB’/ 
aircraft* This LCS has been evaluated on NASA's real-time computer and 
is presented in the next chapter. 
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Pig# 4# la Logic Plow Diagram. (IAS - Liapunov Method) 
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Fig. 4.1b Logic Plow Diagram (IAS - Ne\rton-Raphson Method) 
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Pig. 4.6 LONG. P3 VS. NflCH (Nev/ton-Raphson IAS) 
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Fig. 4.10 LRTR . Pll VS . MflCH (Liapiuiov ZAS) 
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rig. 4.13 LflTR. Pll VS. MflCH (tlewton-Raphson IAS} 
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CHAPTER 5 

CASE STUDY U; THE REAL-TZ^^ APPLZCATXOH OF THE LCS 
TO THE P-8 OPBW AIRCRAFT* 

S.I« Introduction 

As indicated In the previous ciiapter, the Learning Control System 
(LCS) using the Uewton-Rai^son IAS had better performance than the LCS 
lising Liapunov IAS. Cur main concern in this chapter is to apply the 
I^S using f!ewton-Raphson IAS to the piloted six degree of freedom sirula- 
tion of the ?-a DFBV/ aircraft. This was performed at NASA, LBCf on the 
real-time simlator. 

Liapunov's IAS appears to have a converc^ence problem when applied 
in real-time. This ^ in contrast to the fact that Liapunov's IAS is by 
Construction stable, i.e. the convergence of the IAS and hence of the 
LCS is alv^ays guaranteed. The reason for this discrepancy is that the 
integration routine involved In the adaptive loop requires integration 
sfepsize as small as a hundredth of the smallest time constant of the 
aircraft dynamics. Since with such a small stepsize« the approximation 
of Liapunov's function derivative is no longer semi-definite therefore, 
one becomes confronted with a covergence problem. 

The Learning Control System (LCS) designed using Newton-Raphson 
IAS has been modified for application for real-time control of the P-8 
DFSifl aircraft. A problem of the N«4ton-Raphson IAS is the requirement 
to to determine the sensitivities of the system state to unJenovm param- 
eters, which is the most time consuming part in the XAS. For this reason, 
research has been done on the computation of state Sehsitivitias using 

* This study has been conducted at NASA^ Langley Research Center. 
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( 12 ** 0 ) 

coduced order nodels* *' In our appllcationf descrilsed In this chap- 
ter! we us<! first or second order models which generate a modeling error 
noise* This indicates the need to malce tests on Information prior to 
taodifying parameter estimates. These tests and convergence tests are in 
corporated in the convergence criterion* 

In the next two sections we will describe the real-time application 
of our LCS to the longitudinal and lateral dynamics of the P-8 OPBi/ air- 
craft! respectively and in the final section! we will discuss our results* 

5*2* The Heal-Tlme Longitudinal LCS 

For the longitudinal LCS the dynamics of the plant were simulated 
according to the equatiois 


V ■ - q S/(Cjj - cos«<)/n - g sinV 

• 

y • q S(C^ ♦ since )/iav - g cosy/v 

q. q Sc Cj/Iy (5*2*1) 

O a q 

e 

L a V siny* 

whjere v is the air speed « 1 ( is the flight path angle, q is the pitch 
rate^ 9 is the pitch angle, L is the altitude and oc a e . yr is the 


angle of attack* 

The model used for the longitudinal IAS has the form 


^ - q ♦ 9 ^ 0 ^ 


(5.2*2) 


q a ♦ Pjq ♦ P 3 d^ 

The model given by Eq* (5*2*2) describes the short period longitudinal 


node* The longitudinal phugoid mode was not modeled bed^use its time 
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constant is nomally large etnd its identification is not possible with 
a short tern span data base. Also, its definition is not a tnajor element 
in adjusting the feedback gains of the primary control loops. Therefore, 
our model given by Eq. (5.2.2) is only of second order in view of the 
plant being of fifth order. 

For the real-time computation of Sq. (5.2.2) we utilised a dlscrete- 
tirje approximation* The discrete-time approximation is based on the divi- 
sion of the time axis into time intervals of T => 0.125 seconds each. Since 
the time increment T is sufficiently small compared with the time constants 
of the system, the response evaluated by discrete-time methods will be 
reasonably accurate 

As it is shown in Appendix 3, the difference equations that approx- 
imate a linear system of the form 

X ■ Ax ♦ BiT (5.2.3) 

s 


are given by 


x(k+l) « (TA + I)x(k) ♦ Tb/ 


(k) 


(5.2.4) 


For our nodel described by Eq. (5.2.2) we have 


?1 




and 


(5.2 .5) 


B • 


Therefore the real-time computation of Eq. (5.2.2) is evaluated using the 
discrete-time system 


r.i 


Ik^l 


1 ♦ Tp^ T 

IPj l.TPj 


L^3J 


'fs 


(S.2.6) 
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The fit error for the Neivton-Raphson IAS is defined as 
xf 

j(p) - H [ 2 ]^^ ^11 * ^ 22 ] (S.2.7) 

k3t 


where oi and q are the sensor rReasiurements of the angle of attack and 
pitch rate respectively, and are the elements of isatrix B« Note that 
the number 25 (upper limit of sunxtation) is the size of the data base over 
v*ich the Nev^ton-Raphson IAS iterates. 

To develop the Newton-Raphson IAS, let us compute the gradient of 
the fit error with respect to p. This yields 


(^ J/3p) - 


where 




®1^I1^ /b ?i)jc ♦ «2^22^^ Pl^k 

^2>>: * P2’)c 

®1^11^ /b P3)j, ♦ e2b220q/dP3>^ 

eibiiOoc/^ P4^k * *2^22^^**^^ P4^k 
o< ^ o< 
q - q 


(5.2 .8) 


(5.2.9) 


To obtain an approximation of the second partial derivative of the fit 

error with respect to the peurameters , let us differentiate Sq . (5.2.8) 

vfith respect to p ^d neglect the second order partials of the states 

9 9 

with respect to p. This yields the matrix o J/d(V whose elements are 
given by the following equation 

X5 

Ksl 

Zf 

(b^j/dp^)32 • (^^J/&p^)2i * ^ '‘*11^ /a * 

k*t 

♦ b220 q/ b pj) jc< ^ q/ ^ ic 
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xf 




( a^J/3p^),4 - ( ^^J/^P^)43 - y Pl>jc< ♦ 


kii 

* 1»22^^ ^ Pl^lc^ ^ q/ 3 P3> Jg 

14 - < - X 

k=i 

♦ ^22^ ^ ^ jg( ^ q/ 9 P4) ij 

{‘a^J/ap^)22 *2 ^11^ 


k»i 


zr 


0*J/dP^23 • <a*J/3p^)32 - 2] >’j30«/»P2)kO*/SP3>le 

k=i 


♦ ^ 22 ^^ ^ ^ q/ ^ ?3> jj 


(5.2 


zr 


(3^J/35*)24 - (^'J/3P")« - * 

x«l 

♦ b22(^q/t>p2)jc<^q/^P4^jc 


( B‘^J/ 3p^) 33 
<a*j/3p*)34 

(3*j/ap*)44 


A 

• 2 ^11^ ^ P3^ k ^22^ ^ k 

ksl 

xs 

. O^J/8p*) 43 .y b^^{3<x /dP3>,t( '9«/aP4)fc ♦ 

♦ bjgC q/ S P3 J ^ ^ P4^ k 

Zi” 

- I ‘>ll‘»“/^^4>^ ♦ b220q/3P4>J 

k*t 


. 10 ) 
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Tel cORpute the sensitivities Involved in Sqs* (5.2.8) and (5.2.10) let 
us differentiate Sq. (5.2.6) with respect to p. This yields 

( d^/d ^ <i ♦ Tp^)(3*^ ■*“ 

( 3 ■ <1 ♦ P2Jjg ♦ "TO ?2^k 

( ■ (1 >► T?^)(^5< /3 ^2)3^ ♦ T(3 q/3 Pjlj^ (5.2.11) 

(3»</3p^)j^^j^ - (1 ♦ Tp^)0^/5pJj^ •► TOq/5p^)j. >Tot 

O q/^Pl).^^l * T?2)Oq/a p{iy^ ♦ 

(Zq/d ?2J]C+1 • ^ Pa^jj ♦ ♦ ‘T?2><^q/^ P2^jc ■*’ ^ 

(■^q/3 P3)j^^2 ■ /^?353^ ♦ (1 •► Pa^e * ^*s 

O q/^P4*\j^X “ ^1^ ^ ■*■ ^ 

The difference Sq. set (5.2.11) is solved startinq with zero initial 
conditions, since the Initial state is independent of the initial choice 
of the parameters. 

After the first and second partlals of the fit error are com- 
puted with respect to the parameters p^, the p ara me ters eure adjusted 
according to the following equation 

%♦! • Pj • J/3p)^ 


(5.2.12) 
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IXiring the iteration process defined by Bq. (5.2.12) the information 
content of the data base is checked by examining the determinant of 
the matrix ( B^J/3p^) prior to making iterations on the vector p, 
especially, if the determinant is lower than 10* . !^iis number was 
determined by examining the variation of the determinant of the matrix 
P^) on the real-time conputer during steady-state operation 
of the system with no input signal applied. In this case, the varia- 
tion of the determinant is caused by measurement noise so that if the 

-7 

determinant is lower than 10 we considered that there was no adequate 
information in the processed data-base therefore, the parameter vector 
^ was not adjusted. Another test incorporated in the convergence cri- 
terion did not allow more than 8 iterations over the data-base. We 

I 

disc checked that the fit error should be lower than 0.1 and the Euclid- 
ean norm of the gradient vector (^J/dp) be lower than 5 x 10*^. 
Finally, the four diagonal elements of the matrix ( 3 j/d p ) were 
checked to be lower than 1,000, 300, 300 and 300 respectively. The 
numbers 1,000, 300, 300 and 300 denote the maximum allowable covariance 
of the parameters p^^, p^, and p^ respectively and v/ere chosen ex- 

perimentally using the real-time computer at NASA. When these tests 
were satisfied, the parameter vector p was passed to the Learning 
Algoclthn Subsystem (LAS) • 

For the longitudinal application of the LAS, the relationship 
between the parameters as a function of mach number H and alti- 
tude L uas represented by second order polynomials. This relationship 
is given by 

* Thd reason for selecting only 8 iterations is that we wished to check 
the convergence during one second due to time limitation by the real- 
time sisulator. 
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pj^(M.L) - ♦ CjM ♦ Cjt ♦ ♦ CgL^ 

p^CK.L) - C^ ♦ CgM ♦ CgL ♦ C^HL ♦ (5.2.13) 

Pjta.L) . <^3 > c^^a ♦ c^L ♦ Cj^i. ♦ ♦ c^t* 

p^<a,L) . Cj5 ♦ CjijK ♦ Cj^L •► CjjKL ♦ Cjy-^ » C^^t? 

The LAS then produces the coefficient vector C by the Iterative applica- 
tion of Sqs. (3.4.14) and (3.4.16). The obtained C, 's are then passed 
to the !!eniosy and. Q»ntrol Process Subsystea (i£PS} . 

The design criterion for the longitudinal IX? S irfas to maintain 
a short period danping coefficient of 0.7 of critical damping indepen- 
dent of frequency. TO neet this criteriont the elevator control was 
chosen in the follo^v'ing fona 

Y - f ♦ K q + K,/- (5.2.14) 

e e^ A <.9 

The first tern on the right side of Eq. (5.2.14) is a trie integrator 
’.irhich allows the pilot to bias the total elevator position vising a three- 
position switcn (pitch up, off, pitch doi^n) . Figure (5.9) indicates 
the variation of the short-period root as a function of the feedback 
gain K^. It also indicates the ui^er linit inposed on the gain. The 
limi t is necessary to avoid instabilities resulting frov rata limiting 
of surface actuators. Substituting fiq. (5.2. 14) into Eq. (5.2*2) and 
rearranging terms yields 
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o<. - q ♦ p^cfC 

q • Pj^of ♦ (pj ♦ * *2 


(S.2.1S) 


The characteristic aquation of tha system daaccihed by Eq. (S.2.1S) Is 
obtainad as 


* - Pi 


A 

- Pi 


- 1 


* - (P2 ♦ P3K1) 


• 0 


(5.2. 16) 


or 


s - (pj, ♦ p^ ♦ - Px ♦ P4 <P2 P3V * ® 


(5.2.17) 


therafoca 

2j* - - (?2 ♦ ♦ Ps^X^^'^ ^1 ^4 ^^2 * ^3*^1^ (5.2.18) 

Solving Sq. (5.2.18) for and substituting the requirement 
^ « 0.7 yields 

Kj - ( - ?2*y - 2p^ - p^)/p^ (5.2.19) 

A 

Equation (5.2.19) cosprises the control law and the parameters Px'^ 
substituted using 2q. (5.2.13). The M and L used in Sq. (5.2. 13) 

are the current contents of the memory for which the ga. n scheduling is 
computed. The results obtainad for the longitudinal case are discussed 
in section 5.4. 


5.3. Ihe Real-Time Lateral LCS 

?or the lateral LCS the dynamics of the plant were used utilising 
the existing NASA's real-time simulation of the six -degree-of -freedom 
dynamics of the P-8 0F£M aircraft. The simulator is described In refer- 
ence (92) • the six-dagree-of -freedom nonlinear equations can also be 
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found In several textbooks (7,17 ). The slmalatlon was inechanlzed using 
a COC 66C0 conputer operating In real-time. The simulation code was ar- 
ranged to be consistent with the P-a DFEW capability in that data samples 
were taken at 0.125 second Intervals. 

The perfonaar.ee of the plant during sijiulated flight over a wide 
range of conditions is given In reference ( 61) . It represents a simu- 
lated flight to exasilne the lateral system characteristics. In the run, 
zero-mean Gaussian noise was added to the sensor measurements. The noise 
levels were 0.67°/second for rate measurements and 0.67® for angular 
measurements. The control law was not engaged at time intervals during 
the run to determine the characteristics of the xonaugmented vehicle. Noise 
was propagated to the aircraft surface commands only during times when the 
control laws were engaged. 

To develop a LCS for the lateral dynamics, we first need to consider 
a model that characterizes the predominant lateral modes. Note that for 
implementing tne LCS on a flight computer, the primary control loop pro- 
cessing is assigned high priority ;vhile the IAS and LAS are processed in 
a time avetilable basis. This Indicates that the IAS and LAS apply to data 
taken in the past during the same flight. 

The model used for the lateral IAS has the fora 

r - 5^r + $2 0 + ?7 J'p <5.3.1) 

- r . 

\4here p is the roll rate, r is the yaw rate, J5 is the angle of sideslip 
and p^ are the parameter to be identified and processed by the learning 
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m 






ir- 




algorlthn. The terms a and b are constants which depend on the moment 
arm of the vertical stabilizer* These constants were included in order 
^ reduce the number of parameters that must be identified. Note that 
^ is the rudder actuator and lT^ Is the aileron actuator. 

The model assumed by Eq. (5.3.1) describes the lateral roll mode 
and the lateral Dutch roll mode. The lateral spiral mode was not modeled 
because its time constant is normally large and Its identification is not 
possible v;ith a short time span data base. Additionally, its consider- 
ation is not a major element in adjusting the feedback gains of the pri- 
mary control loops. 

For the real-time computation of Eq. (5.3.1) we again utilized 
a discrete- time approximation. The discrete-time approximation is based 
on the division of the time axis into time intervals of T ai 0.125 sec. 
each. The response evaluated by discrete- time methods will be reasonably 
accurate since the time increment, T, is su.xrciently small compared with 
the time constants of the system. 

As shown in Appendix B, the difference equation representation of 
the model is obtained, as before, by Euler's method. (See 5q. 5.2*4). 

For our model described by Eq. (5.3.1) we have 



(5.3.2) 
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Therefore, the real tine computation of Eq. (5«3«2) is evaluated using 
the discrete-tine systen 


r 

U* Jki>l 


1 ♦ 


1 ♦ Tp, 




Tp. 


L 

0 

-T 



+ 


Tbp7 


- 


0 

Tpi^ 


j; 


0 

Tap7 


1 


1 + Tp, 


5J 


r 1 
P 


u- J 


(5.3.3) 


(5.3.4) 


For the lateral dynamics to implement the Newton-Raphson IAS, we used 
two separate performance indices. This was done to reduce the onboard 
computations. The cost fmction for the roll mode was 

i.S' 

Jj^fp) • H ^ (P - P)^ 

kzl 

A ^ A 

and is minimized over the p^, P 2 and Pg variables. Note that p denotes 
the sensor measurements of the roll rate. For the Dutch roll mode the 
cost fvuiction x>ras 

J2(p) » h + y (5.3.5) 


'k 22 


fcsl 


ksi 
AAA 


and was minimized over the p^, p^, p^ and p^ variables. Note that yS 
and r are the sensor measurements of the sideslip angle and the yaw rate 
respectively. 

The partitioning results in the requirement of invertixig one 
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3x3 matrix and one 4x4 matrix as opposed to one 7x7 matrix In the 
Newton- Raph son algorithm* To apply the lateral IAS we need to compute 
^ p» 1 ^J 2 /dPi is done by differen* 

tiatlng Eqs* (5.3.4) and (5.3.S) with respect to p which yields 


if <p - p)u op/api>k 

3jj/ap-.\ (p - p)|j op/Bpj)^ 

.1 

<P - P)^ Op/^Pg)j, 






* =2‘=22‘^‘"^^P3>fc 

* ®2^22‘^‘^''®P4>IC 
®Al‘ V''3Ps>k ♦ ®2'’22<®-''3P5>fc 
*X‘=I1< V P7>k * V22*’’ 


v/here 


®x - / -Z’ 


«2 - ' - 


2 2 2 2 

To obtain '9 Jj^/Sp and '3 ^ 2 '^^^ differentiate Sqs. 

(5*3.6) and (5.3.7), v/ith respect to p and neglect the second order par- 
tials of the states with respect to p. This yields the follov/ing equa* 
tions for the elements of the above matrices 




r3*J/3p*)l2 - O^V^P^’2X *X 
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zf 

X>l 

ir 

O^Jj^/ 3P^)22 -^(^p/dPj).^ (5.3.9) 

h:i 

tr 

(Z^J^/dp^)22 - -^^^P^9P2^k<^P/^Vk 

fc** 

zr 

(‘3^Ji/ap^>33 -^(^P/aPg)^ 

ksi 

and 

O^Jj/ap^)!! * ‘= 22 ‘^'^^P 3 >k 

K*l 

XS" 

. (a%/ 3 p^) 2 i + 

< 5 »f 

■I. b22<^ P3)j^(5i^/^P4^3C 

XT 

(a^jj/ap^jjj - (a%/3p^)3j ‘>ii<¥'^p3’ic‘¥''®p5’ic * 

KSI 

+ b 22 <^ r/ 5 Pj) 3 r/ ^ Pg) 

xf 

<a%/ap^)3^ . (3 %/®P^>41 "2! P3>1c'¥/3P7)ic ♦ 

Xs| 

+ b22(^r/Bp3>,^Or/ap7),^ (5.3.10) 

(■^\/^P^)22 -2^11<^/''^P4^k * 

Kai 

0 ^J 2 /^P ^>23 - 

k=/ 

+ b22(^r/aP4)J^(^r/^P5)^ 
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ks| 

♦ bjj o r/ S P4) jjO r/ ^ P7) Jg 

ir 

* P 22 <^ 

icai 

i.r 

o%/ 3$^3^ . o^a"p2),3 .^b3,o//aJ5)^oy3/dS3)^ ♦ 

ka| 

♦ b 22 ( ^ r/ d Pj ) J.C ^ r/ d p^) 

Lf 

k*i 

To compute the sensitivities involved in Bqs. (5*3 . 6 ) and (5.3.9) let us 
differentiate the first equation of the Eq. set (5.3.3) with respect to 

^ A A 

Pj^, p 2 and pg. This yields 

Op/S » (1 + Tpj^)0 P/^ Pi>]^ + ^ 

( S p/ 3 ?2^ ■ (1 + Tpj^) O p/ ^ P 2 ) (5.3.11) 

Vk+1 “ + "^Pl^ ^ ^ P/^ Ps^k ■** ^'^a 

In the same fashion we differentiate the last tv» equations of the Bq. 
set ( 5 . 3 . 3 ) with respect to p^, P 4 , P 5 and p^ to compute the sensitiv- 
ities involved in Eqs. (5.3.7) and (5.3.10). This yields 

(Sr/dS 3 )j^^^ " * Tp^)(Sr/3p3)j^ + T? 30 //S $ 3 ),^ ♦ Ty3 

(Sr/ap4 )fc^i - (1 + Tp^)(^ ^ 
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F 


- u ♦ Tj^)Or/aP5>)c ♦ 

Or/a - <1 ♦ •:p7H^r/3P7)^ ♦ T^jO/S/ap,!^ ♦ »-^r 

0(»/aP3>k*i ■ - TOr/aSj)^ ♦ (1 ♦ TPs' t^/'/^Ps'k 
), , > - T(ar/ap^)jj ♦ u ♦ Tpj) iip/3 p^ijj 

- TO r/a P5>^ ♦ <1 ♦ TP5) Oyi/ 3 P5>fc ♦ T/» 


(5.3.12) 


(3/5/3 P4-k^.l 


(3/5/3 Pjlfc*! 

Op/dPT>M ■ • Vfc * * '^ 5 ' '^/’^^‘’ 7 'lc 


+ Ta/V 


After the first end second partials of the fit errors and Jj 
given b/ Sqs. (5.3.6 - 5.3.10) are computed with respect to the parame- 
ters the parameters S,. Sj and are adjusted by the following 


P 


Pj - (3*jj/3P^)'j (^Ji/ap'j 


j*l 


(5.3.13) 


and the parameters P3. P^. P5 P7 “® adjusted by 


S .l3\/3p^r] (3vap)j 


J+l *^3 


(5.3.14) 


The information content of the data base is considered W be ^e=p.ate 
for adjusting the paramarers if the determinants of ( 3 J^/a P ) ond 
(-3^3 /35^>. are higher than lO"®. This number was determined, as in the 
longlLdlnal case, by examining the variation of the two determinants on 
the real time computer during steady-state operation of the system with 
no input signal applied. -Another test performed by the convergence cri- 
terion was to limit the number of iterations over any data-base up to 8. 
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The fit errors Jj^ and were checked to be lower than 1 and the gradient 

vectors <Sjj^/3p) and Ojj/dp) were checked to have Euclidean norm 
-4 

lower than 3 x 10 • Finally , the maximum allowable covariance of all 

the lateral parameters p^^ through p^ was set to be 200. This number was 
chosen experimentally using the real time computer at NASA, v/hen these 
tests were satisfied, the peurameter vector p was passed to the Learning 
Al^rithm Subsystem (LAS) . 

For the lateral application of the LAS, the relationship betv/een 

A 

the parameters as a function of mach number M and altitude L was 
represented by second order polynomials. This relationship is given by 


PjCM^L) 

P3(M,I.) 

PgCM,’) 

PgCM,L) 

P^OmL) 


+ CjM + 




S3 * 


S9 " 


Ss ^ Se'* 


Si * S2‘"‘ 


S? * Ss” 


C,L + C.M L ♦ C.M^ + 

3 4 5 6 

C,L + CjjH L * 

♦ 'is'- * <=1S'-' ^ 

♦ =21^ ♦ =22« ^ ♦ =23“* ♦ =24'-^ 

♦ =27*- * =28*’ ^ 

♦ =33^ * =34" ^ ♦ =35^’^ ♦ <=36‘* 

♦ =39^ ♦ C«j" ♦ =4l"^ ♦ =42‘* 


(5.3.15) 


The LAS then produces the coefficient vector C the iterative application 
of Eqs. (3.4.14) and (3.4.16). The obtained C^'s are then passed to the 
Memory and Control Process Subsystem (IK7S). 

The design criterion selected for the lateral characteristics of 
the Dutch roll mode was to maintain the damping coefficient of 0.7 of 
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critical dampiog Independent of frequency. To meet this criterion the 
rudder control was chosen in the following fora: 

- K^r ♦ (5.3.16) 

Substituting Eq. (Sc3.l6) into the last t>.e> equations of Eq. set (5.3.1) 
yields 


r - (p^ ♦ Pt^>^ P 3 / * P7^4^s 

- (- 1 ♦ a P 7 K^)r + + a 


(5.3.17) 


The characteristic equation of the system described by Eq. (5.3.17) is 
obtained as 


.A 


f ♦ PyKj) 


1 * ap^K3 


A 

- Pi 


A 

s - p^ 


0 


(5.3.18) 


or 


,A A A ^ A 

- Cp^ ♦ Py + PyK^yS + p. 


(1 - a^,K,) 


(5.3.19) 


therefore 

2j- - - (p^ ♦ P 5 + P^K^yz/p^TlT^^) (5.3.20) 

Taking the square of both sides of Eq. (5.3.20), substituting for 
^ - fz/2 =0.7 and rearranging terms yields 

♦ 2p, fp^ ♦ Pjd ♦ a)] ♦ (p^ » pj>^ - 2pj - 0 (5.3.21) 

The solution of Eq. (5.3.21) for that provides positive damping is 
tzOcen, and this comprises the control law for the Dutch roll mode. For 


108 


the lateral characteristics of the roll node, the design criterion was to 

keep the roll node tinie constant at 0.2 sec* and to maintain uniform ma* 

( 61 ^ 

n^ver effectiveness of the lateral stick in producing roll rate • 

To meet this criterion, the aileron control ties chosen i» the following 
form 

- Kg p + kg/^ (5.3.22) 

Substituting £q. (5.3.22) into the first equation of Eq. set (5.3.1} yields 
P * (?! ♦ PgKg) p + $2^ + (5.3.23) 

For this first order system the time constant is given by 

r « - V(pj^ ♦ PgKg) - 0.2 (5.3.24) 

Solving Eq. (5.3.24) for Kg yields 

Kg » - (5 + Pj)/P5 (5.3.25) 

The lateral stick control effectiveness is required to be 5, therefore 

Kg - 5/Pg (5.2.26) 

By this choice, the ma:cimum lateral stick deflection will produce 
2)7 rad/sec. roll rate^^^\ £qs. (5.3.25) and (5.3.26) comprise the 
control law for the roll mode. The way to evaluate the control law 
which consists of the solution of Eq. (5.3.21) and Eqs. (5.3.25) and 
(5.3.26) is to substitute for the parameters using Eq* (5.3.15). 

The Cji^'s used in Eq. (5.3.15) are the current contents of the memory and 
M and Ij are the values of mach number and altitude for which the gain 
scheduling is computed, respectively. In the next section wc present and 
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discuss the results obtained for the lor< 9 ltudinal and the lateral cases* 
5*4* Results and Disc us sion 

The learning control system vas applied to the six-degcee-of- 

i 

freedom real-tline sinulation of the OFEr/ aircraft on NASA, Langley* s 
CDC €600 computer* 

The results shown in Pigs* 5*1 - 5*3 illustrate the learning of 
p^t P 2 and p^ of the longitudinal dynamics at sea level altitude over 
a range of Mach numbers M ■ .3 to M a *9 based on continuous flight 
as indicated by the Mach number and altitude time histories, shoiei in Pig* 
5*4 (a,b) • The associated angle of attach, pitch rate, elevator command, 
and pilot's command are shown in Pig. 5*4 (c,d,e,f) . Figure S.4 (g,h,i, 
j ,;k) shoiva the time histories of the variables associated with the con- 
vergence criterion during the same flight course. The convergence crite- 
rion consists of the monitoring of the values of the performance index 
(ffig. 5*4 (g))f the determinant of the information matrix (Pig* 5*4 (h)) • 
arid the three diagonal elements of the information matrix after it has 
been inverted (Pig* 5*4 (i,j,!<))* VJhen the determinant of the information 
matrix is lov/ there is no information in the processed data base* In the 
portions of time when the determinant has high values, only then can we 
iterate through the Newton-Raphson algorithm and this corresponds to some 
control activity as indicated by the pilot command and the elevator com- 
mand* Itote that the diagonal elements of the inverse of the information 
matrix have low values when the determinant is high and high values when 
tlie determinant is low, as would be expected* The performance index has 
low values except at the final portion of time where the states of the 
aircraft are highly excited as shown by the angle of attack time history. 
This indicates that the model does not match the plant dynamics for big 


perturbations since the niodel describes onl/ some linearized modes of 
th« dynamics* Note that the descending steps of the performance index 
result from the updatir4g of the parameters done in the Nevrton-Raphson 
elgorithn* 

The learned curves shown in Pigs. S.5 - 5.7 illustrate the learn- 
ing of p^» P 2 end of the lateral dynamics at sea. level altitude over 
a range of I!ach numbers M * .3 to M • .6880 bvased on contimuous flight 
asi indicated by the r-'.ach number and altitude tine histories shovm in 

Fig. 5.8 (a.b). The associated states of the Dutch roll mode, sideslip 

i 

angle and yaw rate, are shovm in Pig. 5.8 (c,d) , and the rudder command 
is shown in Pig • 5.3 { e) . The roll rate and the aileron command are 
shown in Pig. 5.3 (f,g). Finally, Fig. 5.8 (h) shov/s the determinant of 

i 

the information matrix. *Note that the determinant has high values only 
during periods of control activity. Although the flight was continuous, 
learning of the parameters has occurred only for particular flight con- 
ditions selected by the convergence criterion when it was satisfied. 

The learned curves shovjn in Figs. 5.10 - 5.13 describe the 
learning of the longitudinal parameters p^, Pj, and p^ as a function 
of both, altitude and mach numbei:* (L,K) over the entire flight envelope. 
These curves are also based on continuous flight as shown in Pig. 5.4 
(a,b) . Note that the grid interval of those three dimensional curves is 

5.000 ft. for altitude and .05 for mach number. Another interesting ob- 
se^ation is that the intersection of the three dimensional curves with 
the plane L ■ 0 corresponds to the two dimensional curves shown in Figs. 

5.1 - S.3. 

The learned curves shown In Figs. 5.14 - 5.20 describe the learn- 
ing of the lateral parameters Pj^ through over the entire flight 
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envelope based on continuous flight as shown in Fig. 5.8 (a,b). 

2b ozrder to evaluate the performance of the learning control 
systenif we computed from the real-time simulator the values of the 
p^tial derivatives of the moment with respect to angle of attach oC , 
pitch rate q and elevator deflection t , M ^ ). The time 

histories of these partial derivatives during a flight as indlfcated by 
the nmch ntimber and altitude in Fig. 3.21 (a.b) are shown in Figs. 

5.21 Ch,i, j) . The variation of the partial derivative of the lift with 
respect to angle of attach o< (L^ ) during the same flight is shown in 
Pig. 5.21 (h) . The associated learned parameters (p^, p^. p^ and p^) 
are shoim in Figs. 5.21 (d,eyf,g). It is interesting to note that the 
learned parameter p^^ does not have the spikes depicted by the real-time 
since these spikes are due to elevator variations and in our learn- 
ing control system we restricted p^ to be a function only of nach nu m ber 
and altitude. Another observation is that the cost finction (J) shown 
in Fig. 5.21 (c) has high values initially , meaning that the model is 
misaligned substantially from the plant. This is true if we note that 
P 2 initially has positive values vdiere the real-time is negative. 

As soon, as an identification occurs p., has negative values and the cost 
function (J) is reduced. 

The same comparisons are possible also for the lateral caia. 


but were not carried out due to enormous additional amount of simulations . 


In the next chapter we summarize the development of the learning 

control system as presented in this dissertatite and present the final 

:! . 

conclusions* 
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Fig. 5.4 contliiued: (l,j,)c) the three diagonal elements of the Inverted Information matrix. 
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Pig. 5.5 LflTR, PI VS. MflCH 
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Fig. 5.8 Real-ti.ne simulation time histories of the lateral dynamics of F*8 DFBW aircraft 


(a) mach number (b) altitude (c) sideslip' angle. 
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Pig. 5^9-^ - Short-^Perlod_^Itoot Locus for Pitch Ra te Feedback Gain Variations 
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Pig. 5.17 


LflTR. 


P4 VS. MRCH, ALT IT. 















Fig. 5.21 Comparison of the learned parameters with the P-8 DFB^ 

aircraft longitudinal parameters during a real-time simulation, 
(a) mach number (b) altitude (c) performance index 
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Pig. 5.21 Continued: ^ 

(d) Learned longitudinal parameter (e) 
longitudinal parameter p-, (f) learned longitudin^ 

parameter p, (g) learned longitudinal parameter p^ 
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Fig. 5.21 Continued: if 

(h) Real (F-8) longitudinal parameter | 

(i) Real (F-8) loncitudinal parameter p^ (M^ ) “l| 

(j) Real (F-8) longitudinal parameter p^ (H^ ) || 

Oc) Real (F-8) longitudinal parameter p^ (L^ ) 
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CHAPTER 6 

SUrS^lARY AND GDNCLUSI(»fS 

6.1« Summary 

This dissertation describes one of the many possible ways to develop 
aii adaptive learning control system and its application as a flight con- 
trol system for the P-8 DPBV/ aircraft. This leeumlng control system blends 
the gain scheduling and adaptive control into a single system that has the 
advantages of both. 

One important feature of this adaptive learning control system lies 
in its edjility to adjust the gain schedule in a prescribed and learned 
manner to account for changing plant operating characteristics. Another 
important feature of the presented adaptive learning scheme is that one 
needs to identify the plant's parameters only at selected operating condi- 
tions in order to obtain reasonable adjustments for the gain schedule over 
a large range of operating conditions. This is done by the use of the 
coefficients, C^s, (weights) produced by the learning algorithm subsystem 
and stored in the memory. The gain schedule for every possible operating 
condition is then determined by the weights. In this fashion, the leam- 
iiig control system makes practical the real-time computation of the gain 
schedule. Another feature of such an approach is that it may be imple- 
mented with sufficiently inexpensive technology to make use in control 
systems to be economical for a wide variety of industrial applications. 

The fxinctional organization of the learning control system with 
a general description of the task of each subsystem was presented in 
Chapter 3. More specifically, sections 3.2 and 3.3 describe two differ- 
ent mathematical techniques to implement the information acquisition 
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isubsystenit one based on Liapunov's direct method and the second based on 
Nlewton«Raphson method* Ibe next two sections (3,4 and 3.5) describe 
mathematical techniques to implement the learning algorithm subsystem and 
the memory and control process subsystem respectively* 

Perusal of the existing literature about adaptive learning control 
systems and flight control systems was discussed in C3iapter 2. Section 
2*2 introduces a popular class of adaptive systems, namely the parameter 
adaptive model reference systems and discusses mathematical techniques to 
implement the adaptive algorithm based on Liapunov's function and on the 
gradient approach. Section 2.3 examines the present state of learning 
control systems and their applications and also presents the most promi- 
nent mathematical techniques to formulate a learning system. The advance 
ments of flight control systems, as a result of the evolution of digital 
flight control systems, are examined in section 2.4. Section 2.5 gives 
particular reference to the MASA P-8 Digital Fly-By-VJire (DFB*.-/) program* 
Qiapters 4 and 5 give the application of the learning control 
System to control t\-;o different simulations of the P-8 dynamics, and 
present the associated results* I4ore specifically, in Chapter 4 we use 
fourth order simulations of the longitudinal and lateral dynamics of the 
P-3 DPBV7 aircraft and the model used for the learning control system is 
of the sane order. The application considered in Chapter 5 is more re- 
alistic since it was done in real-time for the piloted six-degree-of- 
freedom simulation of the P-8 DFB17 aircraft* Because of the real-time 
constraint* only the predominant motion modes were modeled, using first 
or second order analytical models* 

In the next section, final conclusions are drawn about the 

* as discussed in Chapter 5* 
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pe'Cforwance of the learning control system as evidenced from the research 
carried out in this dissertation and the LCS's applicability to model and 
control various other systems is also discussed* 

6*2. Conclusions 

The aircraft flight envelope used for the learning control system 
design is shovai in Pig. 6.1. Symbols (o) indicate flight condition 
points in the flight envelope for which our models were developed. 

The symbols on the horizontal line L » 0 = sea level indicate mach 
numbers .5, .7 and .9. At these flight conditions the aircraft was 

simulated to obtain the parameter curves shown in Chapter 4, Figs. 4.2-4.13. 
Anj important feature of the LCS, as evidenced in these curves, is that we 
obtain information about the parameters for different flight conditions 
which the aircraft has not experienced. This indicates that once the 
aircraft is in a nev flight condition, we may adjust the gain scheduling 
with no need to identify the p 2 urameters for this new flight condition. 

This, in turn, shows that a dither signal to excite the system is not 
needed during periods of control inactivity, where in conventional adap- 
tive schemes we would have to perturb the system at each different flight 
condition. 

In Appendix A , the figures illustrate the learned parameter curves 
as a function of altitude obtained from the flight conditions Indicated 
in Fig. 6.1 on the vertical line M » .7 (L • 0 » sea level, 20,000 ft., 
40,000 ft.) . Even though the curves shown in Appendix A were obtained 
from different flight patterns than those shown in Chapter 4, Figs. 4.2-4.13^ 
the corresponding parameters have close values for the same flight condi- 
tions* By this, we may conclude that the performance of the LCS does not 
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depend on the particular choice of flight conditions experienced by the 
ajJTcraft; i.e* corresponding parameters are the same independent of the 
selected flight conditions. Though not discussed in this dissertation, 
we may infer that the LCS may be generalized to incoirporate variations 
of the parameters as a function of several other variables, e.g. the 
angle of attack. 

The gain scheduling technique adjusts the gains at each different 
flight condition by having prestored the values of the parameters. This 
is practical for parameters that are functions of one to three veuriables 
hov;ever, for functions of four or more variables, the gain scheduling 
becomes impractical because of the extraordinary memory requirements. 

On the other hand, the LCS introduces a significant saving in storage 
thus requiring a memory of reasonable size and also makes practical the 
real-time computation of. the parameters and the gains by a table look-up. 

By this, v;e may conclude that the LCS may be used to control plants that 
are affected by parameters that are multi-variant functions. 

In view of the longitudinal models used in Chapters 4 and 5, 
namely Eqs. (4.3.2) and (5.2.2), one may observe a correspondence between 
the longitudinal parameters p_, p. and p_ of Eq. (4.2.1) and the longitu- 

J o 

dinal parameters p^^, P 2 and p^ of Eq. (5.2.2) respectively. (See Table 6.1) 

The comparison of Figs. 4.3 & 5.1, 4.2 & 5.2 and 4.4 &. 5.3 show 

that the curves have the same approximate shape but the corresponding 
values are different. This may be attributed to tMO different reasons: 

(1) In Chapter 5 the simulation of the plant describes the entire six- 
degree -of -freedom dynamics of the P-8 DFBV/ aircraft while the plant simu- 
lation used in Chapter 4 describes only the linearized longitudinal & lateral 
dynamics. (2) In Chapter 5 we only modeled the short period longitudinal 

ORIGINAL PAGE Ife 
DE POOR QUALITY 
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mode, the Dutch roll mode and the roll mode, where In Qiapter 4 the model 
Was more elaborate* This simplification was done because of the real-time 
constraint* . 

The correspondence of the lateral parameters used in Chapters 4 

i 

and 5 may be observed by comparing £q,(4*5.2) xvith Eq. (5.3.1) and is 
?(iven in table 6.2. The figures 6.2 and 6.3 Indicate the learning of p^, 
and p ^2 of Chapter 4, respectively and are included here for the sake of 
Comparison. The comparison of Figs. 5.5 & 6.2 and 5.7 & 6.3 show again 

I 

different values of the parameters for corresponding flight conditions, 
which is attributed to the two reasons described above. 

For this reason, we do not have a v/ay to evaluate the curves ob- 
tained from the real-time simulation so we will evaluate the performance 
of the LCS according to how the design criteria are met, namely 0.7 
d^tmpting ratio for the Dutch roll and short period longitudinal modes, 
and 0.2 seconds time constant for the roll mode. During the time inter- 
vals when the LCS is engaged, its performance can be observed. Figure 
644 shows the response of the sideslip angle /3 to a rudder input. This 
figure indicates that when the LCS is engaged at times 30 sec., 60 see., 

80 sec. and 110 sec. the Dutch roll mode damping ratio is maintained at 

i 

'i 

0.7. Figure 6.5 shows the response of roll rate p due to an aileron in- 
pujt. By comparing the steady-state roll rate at times 20 sec., 60 sec., 
80j sec. and 100 sec. the uniformity of the roll-rate response during en- 
gagement of the LCS may be noticed. By this, we may conclude that the 
LC0 is capable to meet design requirements over the entire flight enve- 
lope, without the need to prestore or identify the parameters at each 

• See adapter 5 
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flight condition as would be required by the gain scheduling ^ or the 
conventional adaptive techniques , respectively. In the next section we 
will discuss possiblities for future research on learning control systems 
both from the theory and the applications points of view* 

6*3 Recommendations for Future Research 

Riture research on learning control systems could be approached 
from two points of viev;: a) the mathematical approach and b) the ap- 
plication approach. From the mathematical point of view, the following 
areas of research are suggested: 

!• To develop fxirther criteria for the adequacy of the learning 
control system by improving on the convergence and confidence criteria. 

2. Investigate the performance of the learning control system when 
different ftmctional representations are used to implement ^e LAS. 

3. Ebcamine the learning control system design for distributed pa- 
rameter systems. 

From the applications point of view, the following areas of 
research are recommended: 

1. Fodify the LAS to incorporate variations of the parameters as 

f 

functions of more than two variables as shown in this dissertation. 

This will depend on the particular problem at hand. 

2. Apply the learning control system to control systems such as 
chemical processes and physiological systems. 
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APPENDIX A 


LEARNED PARAI-IETER CURVES AS A FUNCTION OP ALTITUDE 
OBTAINED PROM CASE STUDY I (CHAPTER 4) 


This appendix contains the graphs of the learned parameter 
curves as a function of altitude for mach number M a 0.7 for a high 
order representation of the F-8 DFBVJ aircraft dynamics. The first 
tv:enty figures show the longitudinal parameters p^ to p^^Q obtained via 
tSvo different LCS's one using Liapunov's IAS and the other, by the use 
of Newton-Raphson IAS. The latter tv/enty eight figures show the lateral 
parameters p^ to p^^ obtained via the above two mentioned LCS's. 

These curves were obtained by simulating the operation of the 

I 

aircraft at the following operating conditions: Mach number H a 0.7 

and Altitude L > sea level, 20,000 ft., 40,000 ft. The following re« 
presentations depict the plant's learned and model parameters. 

plant's parameter curve 
learned parameter curve 
model parameter curve 
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APPENDIX B 


A DISCRETE-TEX EVALUATION OF THE TH-E RESPaiSE 
OP A CX)NTINUOUS-Tir!E SYSTEM (EULER’S .’METHOD) 


‘.sider the continuous -time linear system represented by the 
by the vector differential equation of the fona 


X a A X 


♦ bS 


(B.l) 


We wish to obtain the tine response of the system described by Ek;* 

(a.l) by utilizing a discrete-tine approximation. To do so, ue divide 
the time axis into sufficiently small time increments, each of duration 
T seconds. Then the values of the state variables are evaluated at 
successive time intervals; that is t a 0, T, 2T, 3T, ..., IcT. The 
definition of a derivative is 


X a lira jx(t t) - X (t)| /At 


(B.2) 


Since we are interested in the values of the state vector at times 
that are integer multiples of 7, we may substitute in Eq. (B.2) A t a T 
and t a JcT. This yields an approximation of the derivative given by 


f 


jx ^(k ♦ 1) T 

j - X OcT) ► 


> / T 


(B.3) 


Substituting Eq. (B.3) into Eq. (B.l) and letting t a kT we obtain 

|x ^(k ♦ 1) tJ - x (kT)| T a Ax(kT) * (JcT) (B.4) 

Solution of Eq. (3.4) for x ^(Jc ♦ 1) t| yields 
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X j^(Jc ♦ 1) tJ - (T a ♦ I) X (IcT) ♦ T qS^ (JcT) (B.5) 

Equation (B.5) may be rev/ritten as a difference equation 

X (!c + 1) - (T A ♦ I) X (k) ♦ T B X (k) (B.6) 

Equation (B.6) is the iterative operation that relates the state vector 

at the (k l)st time instant in terms of the value of x and S at 

s 

the k-th tiine instant, and it is used to implement the IAS in real-time. 

Due to the real time constraint, our study (Chapter 5) was lim- 
ited to the first order approximation of the time derivative. 
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